CERTIFICATE OF MAILING BY "EXPRESS MAIL 
"EXPRESS MAIL" MAILING LABEL NO.: EL47 6994461 US 

DATE of deposit: December 8 2000 

I HEREBY CERTIFY THAT THIS PAPER OR FEE IS BEING DEPOSITED WITH THE UNITED STATES POSTAL SERVICE "EXPRESS MAIL POST OFFICE TO ADDRESSEE" 
SERVICE UNDER 37 C.F.R. 1. 10 ON THE DATE INDICATED ABOVE AND IS ADDRESSED TO THE COMMISSIONER FOR PATENTS, WA^SHlS^RD^ 20231 



Mikahail Bavlev 



YPED OR PRINTED NAME OF PERSON MAILING PAPER OR FEE) 




SIGNATURE OF PERSON MAILING PA^ER OR 




APPLICATION 
for 

UNITED STATES LETTERS PATENT 
on 

CRYSTAL STRUCTURE OF WW DOMAINS AND METHODS OF USE 

THEREOF 

by 

Joseph P. Noel 
and 

Mark A. Verdecia 



Sheets of Drawings: Five (5) 
Docket No.: SALK2410 



Stephen E. Reiter 
GRAY C ARY WARE & FREIDENRICH LLP 
4365 Executive Drive, Suite 1600 
San Diego, California 92121-2189 




CRYSTAL STRUCTURE OF WW DOMAINS AND METHODS OF USE 



THEREOF 

Vn5 (W^ 



The U.S. Government may have certain rights in this invention pursuant to 
Grant No. GM57533 awarded by the USPHS. 



5 



FIELD OF THE INVENTION 



The present invention relates to crystals of WW domains and more 



particularly to the high resolution structure of Pinl WW domain obtained by X-ray 
diffraction. In addition, the invention relates to methods of using the structure 
coordinates of Pinl WW domain and mutants thereof to screen and design compounds 
10 that bind to or interact with WW domains. 

BACKGROUND 

The process of designing potent and specific inhibitors has improved with the 
arrival of techniques for determining the three-dimensional structure of the enzyme to 
be inhibited. Usually a three-dimensional model of an enzyme is produced by the 

15 creation of a crystalline form of the purified enzyme which is then subjected to X-ray 
diffraction and analysis. While such procedures provide certain valuable information 
that can be used to design inhibitors, they suffer from a lack of knowledge about the 
amino acid residues critical for interaction with a substrate or a substrate mimic. In 
order to address these limitations, enzymes have more recently been co-crystallized 

20 with substrates, substrate mimics or known inhibitors of the enzyme's activity, 
thereby allowing the important interactions to be determined (see, for example, 
Mohammadi, et al. 9 Science 276:955-960, 1997; Lee, et aL, Biochemistry 36:13180- 
13186, 1997;Brzozowski, et aL, Nature 382:753-758, 1997). 

The peptidyl-prolyl czs-trans isomerases (PPIases), or rotamases, are a family 
25 of enzymes important in protein folding, assembly and transport. They act as 

catalysts to promote isomerization about the peptidyl-prolyl bond, which can have 
profound effects on protein function. 



PPIases are divided into three classes, cyclophilins, FK-506 binding proteins 
(FKBPs) and the Pinl/parvulin class. While cyclophilins and FKBPs are 
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distinguished by their ability to bind immunosuppressant molecules cyclosporin and 
FK-506, respectively, the Pinl/parvulin class binds neither of these 
immunosupressants and is structurally unrelated to the other two classes. Known 
members of the Pinl/parvulin class include Pins 1 - 3 (Lu, et al, Nature 380:544-547, 
5 1996), Pin-L (Campbell, et al, Genomics 44:157-162, 1997), parvulin (Rahfeld, et 
al, FEBS Letts 252:180-184, 1994), dodo (Maleszka, et al, Proc Natl Acad Sci USA 
23:447-451, 1996) and Essl/Pftl (Hanes, et al, Yeast 5:55-72, 1989; and Hani, et al, 
FEBS Letts 265:198-202, 1995). 

Recent research suggests that members of the Pinl/parvulin class are essential 
10 modulators of the cell cycle, and mitosis in particular. Lu, et al, Nature 380:544-547, 
1996 (incorporated by reference herein) reports that depletion of Pinl/Essl (a 
structural and functionally related protein to Pinl) in yeast or human cells induces 
mitotic arrest followed by apoptosis, indicating that enzymes in this class serve an 
essential function in cell division and proliferation. 

1 5 The design of new, highly specific antimitotic agents represents an important 

need in the pharmaceutical industry. Such agents can serve as effective 
chemotherapeutic agents for the treatment of a variety of disorders characterized by 
inappropriate cell proliferation, including cancer and infectious diseases. The 
invention disclosed herein addresses this and related needs, as will become apparent 

20 upon review of the specification and appended claims. 

SUMMARY OF THE INVENTION 

A Pinl WW domain crystal structure is provided. Methods of using the crystal 
structure and atomic coordinates for the development of WW domain binding agents 
is also provided. In addition, the disclosure provides computer programs on computer 
25 readable medium for use in developing WW domain binding agents. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 collectively shows the overall architecture of human Pinl. 
Figure 1 A shows the ribbon representation of the Pinl-C-Terminal Domain (CTD) 
peptide complex. Residues 39 to 50 are disordered. Apostrophes distinguish the WW 
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domain from the PPIase domain. The CTD peptide backbone is in the region labeled 
as N, Pro3' and Pro6\ Residues of the CTD peptide are labeled with apostrophes. 
Atoms of carbon, nitrogen, phosphorus, oxygen, and sulfur are depicted. Dotted lines 
depict hydrogen bonds. Figure IB shows a ribbon representation of the Pin 1 -PEG 
complex. 

Figure 2 collectively shows an enlarged view of the Pinl-CTD peptide 
binding interface. Figure 2A shows a ribbon diagram of the Pinl WW domain bound 
to YpSPTpSPS depicted after a 90° rotation around a vertical axis from the view 
shown in Figure 1 A. This view is looking on the concave WW domain peptide- 
binding surface opposite the PPIase domain. The carbon atoms of the CTD peptide 
are in the foreground compared to the WW side chain atoms. The water molecule 
mediating Tyr-23 / phosphate contacts is shown as a sphere. Hydrogen bonds are 
shown as dotted spheres. Figure 2B shows the molecular surface representation of 
the WW domain - peptide interface after a slight rotation around the vertical axis from 
the view depicted in Figure 2A affords this view. The solvent-accessible surface for 
the Pinl WW domain residues was calculated in GRASP (Nicholls et aL, Proteins 
1 1 :28 1-296, 1991), and the acidic and basic residues depicted as the darker regions 
adjacent to Lys6, Glu35, Arg36, Arg21, and Argl7. 

Figure 3 shows a schematic and energetic view of the Pinl-YpSPTpSPS 
complex. Pinl residues are towards the bottom of the figure and are not covalently 
linked to the larger molecule which represents CTD. Residues participating in van 
der Waals contacts are highlighted with gray and the extended van der Waals surfaces 
appear as dotted gray curves. Hydrogen bonds are shown as dashed lines. In the case 
of the S16H and W34H mutants, some of the apparent binding is likely being 
contributed by the PPIase domain. Residues are given as single letter codes. 

Figure 4 collectively shows representative WW domains. Figure 4A shows 
the sequence alignment of 15 WW domains. The secondary structural elements are 
those of Pinl . The numbers correspond to the first residue on each line for each of the 
fifteen WW domains. The Darker gray boxes delineate residues participating in 
phosphate contacts. Shaded boxes corresponding to residues 23 and 24 define 
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residues participating in van der Waals contacts with Pro-3' of the CTD peptide. 
Shaded residues 14 and 25 contribute to the van der Waals surface sequestering the 
backbone of residues 4' to 6' as well as the Pro ring of Pro-6\ Light gray boxes 
corresponding to residue 32 define residues participating in additional hydrogen 
5 bonds with the CTD peptide. Residues participating in more then one class of 
interactions are outlined and coded as described above, h is human, Sc is 
Saccharomyces cerevisiae, d is Drosophila, m is mouse, and En is Emericella 
nidulans. Figure 4B shows stereo views of the Ca traces of the Pinl-YpSPTpSPS 
complex (top) and the YAP65-PPXY peptide complex (bottom) aligned with Pinl's 

10 WW domain. The figure is labeled as in previous figures. The backbones of the WW 
domains are now shown as coils. The arrow in the bottom panel depicts the 
movement necessary to bring the PPXY-containing peptide into alignment with the 
CTD peptide bound to Pinl. The numbering scheme for YAP65 refers to the human 
sequence. The numbers in parentheses correspond to the alternate numbering scheme 

15 used by Macias et al (Nature 382:646-649, 1996). 

Figure 5 is a block diagram of a computer system. 



DETAILED DESCRIPTION OF THE INVENTION 

In accordance with the present invention, methods of identifying WW domain 
20 binding agents are provided. Invention methods include defining an interaction site of a 
WW domain based on a plurality of atomic coordinates of the WW domain, modeling a 
potential binding agent that fits spatially into said interaction site, contacting said 
potential binding agent with the WW domain in the presence of a WW domain substrate, 
and determining the ability of the potential binding agent to compete with the WW 
25 domain substrate for binding to the WW domain. 

In another aspect, the invention provides a computer program on a computer 
readable medium, said computer program having instructions to cause a computer to 
model a potential binding agent that fits spatially into a WW domain interaction site 
defined by a plurality of atomic coordinates. 
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WW domains 

The WW domain is a 38-40 amino acid structural motif that functions as an 
interaction module in a diverse set of signaling proteins. The modular nature of WW 
domain interactions lends itself to classification into five distinct groups based upon 
5 current understanding of their binding specificity. Group I WW domains, exemplified 
by YAP65, recognize PPxY motifs2; Group II WW domains, like FE65, bind the PPLP 
motif. Group III WW domains, such as those found in a subset of FBPs, interact with 
PGM motifs. A fourth group represented by the WW domain of Pin 1 , specifically 
interact with phosphorylated serine or threonine residues amino-terminal to proline, or 
1 0 phosphoiyiated SER _ pRO (p Ser _ Pro ) or phosphoiyiated THR _ pRO (p <T h r _p ro ) mo tifs. Finally, the 

recently classified group V WW domains recognize PGR motifs. Pinl contains an N- 
terminal WW domain and a C-terminal peptidyl-prolyl cis-trans isomerase domain 
connected by a flexible linker. Provided here are the functional and structural basis for 
WW domain binding to phosphorylated protein substrates based on the energetic and 
1 5 structural analysis of a Pinl -phosphopeptide complex. 

Pinl has been implicated in the regulation of a diverse set of mitotically 
phosphorylated proteins including Cdc25, Mytl, Weel, Plkl, and Cdc27, and also 
Taul 1 . In addition, Pinl and its homolog, Esslp, have both been implicated in the 
regulation of the C-terminal domain (CTD) of RNA Polymerase II (Pol II) and they can 

20 both bind directly to the phosphorylated CTD. Phosphorylation of Ser-2 ' and/or Ser-5 ' 
on the multiple YSPTSPS heptad repeats in the CTD of Pol II by cyclin-dependent 
kinases (CDKs) converts Pol Ha to Pol Ho and initiates a cascade of transcriptional, 
processing, and splicing events linked to the elongating polymerase. Structural and 
functional analysis of Pinl established that Pinl 's PPIase domain (which belongs to the 

25 parvulin family of PPIases) catalyzes phosphorylation-dependent peptide bond 

isomerization in P.Ser-Pro segments. Finally, binding analyses demonstrate that the 
Pinl WW domain acts as the predominant P.Ser/P.Thr-Pro binding motif. 

One aspect of the invention resides in obtaining crystals of the WW domain of 
a Pinl protein of sufficient quality to determine the three dimensional (tertiary) 
30 structure of the protein by X-ray diffraction methods. The knowledge obtained 
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concerning Pinl WW domain proteins can be used in the determination of the three 
dimensional structure of the binding domain of other WW domain containing proteins 
(e.g., Npw38 (Komuro et al 9 Nucleic Acid Res. 27(9):1957-65, 1999). The binding 
domain can also be predicted by various computer models. Based on the structural 

5 coordinates of the WW domain (i.e., the three dimensional protein structure of the 
binding domain), as described herein, small molecules which mimic the functional 
binding of a WW domain protein to its ligands can be designed and synthesized. 
Accordingly, in one embodiment, the invention provides a method of "rational" drug 
design. Another approach to "rational" drug design is based on a lead compound that 

10 is discovered using high throughput screens; the lead compound is further modified 
based on a crystal stucture of the binding regions of the molecule in question. 
Accordingly, another aspect of the invention is to provide starting materials for the 
rational design of drugs which prevent or mimic the action of a WW domain (e.g., 
Pinl WW domain) protein binding to its ligand. 

15 Pinl's amino acid sequence is known (Lu et al., supra). The WW domain of 

Pinl includes residues 1 to 39 and the PPIase domain includes residues 50-163 of 
Pinl. The term "amino acids" means the L-isomers of the naturally occurring amino 
acids or unnatural amino acids. The naturally occurring amino acids are glycine, 
alanine, valine, leucine, isoleucine, serine, methionine, threonine, phenylalanine, 

20 tyrosine, tryptophan, cysteine, proline, histidine, aspartic acid, asparagine, glutamic 
acid, glutamine, y-carboxyglutamic acid, arginine, ornithine and lysine. Unless 
specifically indicated, all amino acids referred to in this application are in the L-form. 

The term "unnatural amino acids" means amino acids that are not naturally 
found in proteins. Examples of unnatural amino acids used herein, include racemic 
25 mixtures of selenocysteine and selenomethionine. In addition, unnatural amino acids 
include the D or L forms of nor-leucine, para-nitrophenylalanine, homophenylalanine, 
para-fluorophenylalanine, 3-amino-2-benzylpropionic acid, homoarginine, and D- 
phenylalanine. 

The term "positively charged amino acid" includes any naturally occurring or 
30 unnatural amino acid having a positively charged side chain under normal 
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physiological conditions. Examples of positively charged naturally occurring amino 
acids are arginine, lysine and histidine. 

The term "negatively charged amino acid" includes any naturally occurring or 
unnatural amino acid having a negatively charged side chain under normal 
5 physiological conditions. Examples of negatively charged naturally occurring amino 
acids are aspartic acid and glutamic acid. 

The term "hydrophobic amino acid" means any amino acid having an 
uncharged, nonpolar side chain that is relatively insoluble in water. Examples of 
naturally occurring hydrophobic amino acids are alanine, leucine, isoleucine, valine, 
10 proline, phenylalanine, tryptophan and methionine. 

The term "hydrophilic amino acid" means any amino acid having an 
uncharged, polar side chain that is relatively soluble in water. Examples of naturally 
occurring hydrophilic amino acids are serine, threonine, tyrosine, asparagine, 
glutamine, and cysteine. 

15 The term "crystal structure coordinates" refers to mathematical coordinates 

derived from mathematical equations related to the patterns obtained on diffraction of 
a monochromatic beam of X-rays by the atoms (scattering centers) of a protein 
molecule in crystal form. The diffraction data are used to calculate an electron 
density map of the repeating unit of the crystal. The electron density maps are used to 

20 establish the positions of the individual atoms within the unit cell of the crystal. The 
crystal structure coordinates of the Pinl protein binding domain (e.g., the WW 
domain) are obtained from a Pinl protein crystal having orthorhombic space group 
symmetry P2 X 2 X 2 X with a = 35.27 A, b = 43.90 A, c = 124.66 A, a = p = y = 90°. The 
coordinates of the Pinl protein binding domain can also be obtained by means of 

25 computational analysis. 

The term "selenomethione substitution" refers to the method of producing a 
chemically modified form of a crystal of Pinl. The Pinl protein is expressed by 
bacteria in media that is depleted in methionine and supplemented with 
selenomethionine. Selenium is thereby incorporated into the crystal in place of 
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methionine sulfurs. The location(s) of selenium are determined by X-ray diffraction 
analysis of the crystal. This information is used to generate the phase information 
used to construct a three-dimensional structure of the protein. 

The term "heavy atom derivatization" refers to the method of producing a 
5 chemically modified form of the crystal of Pin 1. A crystal is soaked in a solution 
containing heavy metal atom salts or organometallic compounds, which can diffuse 
through the crystal and bind to the surface of the protein. The location(s) of the bound 
heavy metal atom(s) are determined by X-ray diffraction analysis of the soaked 
crystal. This information is used to generate the phase information used to construct a 
10 three-dimensional structure of the protein. 

The term "unit cell" refers to the basic parallelipiped shaped block. The entire 
volume of a crystal may be constructed by regular assembly of such blocks. 

The term "space group" refers to the arrangement of symmetry elements of a 

crystal. 

15 The term "molecular replacement" refers to a method that involves generating 

a preliminary model of a Pinl crystal whose structure coordinates are not known, by 
orienting and positioning a molecule whose structure coordinates are known. Phases 
are then calculated from this model and combined with observed amplitudes to give 
an approximate Fourier synthesis of the structure whose coordinates are known. 

20 The crystal structure coordinates of a Pinl protein and its WW domain can be 

used to design compounds that bind to the protein and alter its physical or 
physiological properties in a variety of ways. The structure coordinates of the protein 
can also be used to computationally screen small molecule data bases for compounds 
that bind to the protein to develop competitive, uncompetitive and non-competitive 

25 inhibitors of Pinl WW domain. 

A "competitive" inhibitor is one that inhibits a WW domain activity (e.g., a 
Pinl WW domain activity) by binding to the same kinetic form, of the WW domain, 
as its substrate binds-thus directly competing with the substrate for the active site of 
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the WW domain. Competitive inhibition can be reversed completely by increasing 
the substrate concentration. 

An "uncompetitive" inhibitor is one that inhibits WW domain activity (e.g., a 
Pinl WW domain activity) by binding to a different kinetic form of the active site 
5 than does the substrate. Such inhibitors bind to a WW domain already bound with the 
substrate and not to the free enzyme. Uncompetitive inhibition cannot be reversed 
completely by increasing the substrate concentration. 

A "non-competitive" inhibitor is one that can bind to either the free or 
substrate bound form of a WW domain. 

10 Those of skill in the art may identify inhibitors as competitive, uncompetitive 

or non-competitive, by computer fitting enzyme kinetic data using standard equations 
according to Segel, I. H., Enzyme Kinetics, J. Wiley & Sons, (1975). It should also be 
understood that uncompetitive or non-competitive inhibitors according to this 
invention may bind to an accessory binding site. 

1 5 Methods of using crystal structure data to design inhibitors of enzyme activity 

are known in the art. Thus, the crystal structure data provided herein can be used in 
the design of new or improved enzymatic inhibitors. For example, the Pinl WW 
domain coordinates, provided herein, can be superimposed onto other available 
coordinates of similar enzymes which have inhibitors bound to them to give an 

20 approximation of the way these and related inhibitors might bind to Pinl WW 

domains. Alternatively, computer programs employed in the practice of rational drug 
design can be used to identify compounds that reproduce interaction characteristics 
similar to those found between the Pinl WW domain and the co-cry stalized substrate 
mimic. Furthermore, detailed knowledge of the nature of binding site interactions 

25 allows for the modification of compounds to alter or improve solubility, 
pharmacokinetics, etc. without affecting binding activity. 

Computer programs are widely available that are capable of carrying out the 
activities necessary to design compounds using the crystal structure information 
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provided herein. Examples include, but are not limited to, the computer programs 
listed below: 

Catalyst Databases™ - an information retrieval program accessing chemical databases 
such as BioByte Master File, Derwent WDI and ACD; 

5 Catalyst/HYPO™ - generates models of compounds and hypotheses to explain 
variations of activity with the structure of drug candidates; 

Ludi™ - fits molecules into the active site of a protein by identifying and matching 
complementary polar and hydrophobic groups; 

Leapfrog™ - "grows" new ligands using an algorithm with parameters under the 
10 control of the user. 

In addition, various general purpose machines may be used with programs 
written in accordance with the teachings herein, or it may be more convenient to 
construct more specialized apparatus to perform the operations. However, preferably the 
embodiment is implemented in one or more computer programs executing on 
15 programmable systems each comprising at least one processor, at least one data storage 
system (including volatile and non-volatile memory and/or storage elements), at least 
one input device, and at least one output device. The program is executed on the 
processor to perform the functions described herein. 

Each such program may be implemented in any desired computer language 
20 (including machine, assembly, high level procedural, or object oriented programming 
languages) to communicate with a computer system. In any case, the language may be a 
compiled or interpreted language. The computer program will typically be stored on a 
storage media or device (e.g., ROM, CD- ROM, or magnetic or optical media) readable 
by a general or special purpose programmable computer, for configuring and operating 
25 the computer when the storage media or device is read by the computer to perform the 
procedures described herein. The system may also be considered to be implemented as a 
computer-readable storage medium, configured with a computer program, where the 
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storage medium so configured causes a computer to operate in a specific and predefined 
manner to perform the functions described herein. 

Embodiments of the invention include systems (e.g., internet based systems), 
particularly computer systems which store and manipulate the coordinate and sequence 

5 information described herein. One example of a computer system 1 00 is illustrated in 
block diagram form in Figure 5. As used herein, "a computer system" refers to the 
hardware components, software components, and data storage components used to 
analyze the coordinates and sequences set forth in Table 1. The computer system 100 
typically includes a processor for processing, accessing and manipulating the sequence 

1 0 data. The processor 105 can be any well-known type of central processing unit, such as, 
for example, the Pentium III from Intel Corporation, or similar processor from Sun, 
Motorola, Compaq, AMD or International Business Machines. 

Typically the computer system 100 is a general purpose system that comprises 
15 the processor 105 and one or more internal data storage components 110 for storing data, 
and one or more data retrieving devices for retrieving the data stored on the data storage 
components. A skilled artisan can readily appreciate that any one of the currently 
available computer systems are suitable. 

20 In one particular embodiment, the computer system 100 includes a processor 105 

connected to a bus which is connected to a main memory 115 (preferably implemented 
as RAM) and one or more internal data storage devices 110, such as a hard drive and/or 
other computer readable media having data recorded thereon. In some embodiments, the 
computer system 100 further includes one or more data retrieving device 118 for reading 

25 the data stored on the internal data storage devices 110. 

The data retrieving device 118 may represent, for example, a floppy disk drive, a 
compact disk drive, a magnetic tape drive, or a modem capable of connection to a 
remote data storage system (e.g., via the internet) etc. In some embodiments, the internal 
3 0 data storage device 1 1 0 is a removable computer readable medium such as a floppy disk, 
a compact disk, a magnetic tape, etc. containing control logic and/or data recorded 
thereon. The computer system 100 may advantageously include or be programmed by 
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appropriate software for reading the control logic and/or the data from the data storage 
component once inserted in the data retrieving device. 

The computer system 100 includes a display 120 which is used to display output 
5 to a computer user. It should also be noted that the computer system 100 can be linked 
to other computer systems 125a-c in a network or wide area network to provide 
centralized access to the computer system 100. 

Software for accessing and processing the coordinate and sequences of Table 1, 
1 0 (such as search tools, compare tools, and modeling tools etc.) may reside in main 
memory 115 during execution. 

For the first time, the present invention permits the use of molecular design 
techniques to design, select and synthesize chemical entities and compounds, including 
15 inhibitory compounds, capable of binding to the active site or accessory binding site of a 
WW domain (e.g., a Pinl WW domain), in whole or in part. 

One approach enabled by this invention, is to use the structure coordinates set 
forth in Table 1 to design compounds that bind to the enzyme and alter the physical 
properties of the compounds in different ways, e.g., solubility. For example, this 

20 invention enables the design of compounds that act as competitive inhibitors of the WW 
domains by binding to, all or a portion of, the active site of a WW domain. This 
invention also enables the design of compounds that act as uncompetitive inhibitors of 
the WW domain {e.g., the WW domain of Pinl). These inhibitors may bind to, all or a 
portion of, the active site of a WW domain. Similarly, non-competitive inhibitors that 

25 bind to and inhibit a WW domain whether or not it is bound to another chemical entity 
may be designed using the structure coordinates of the invention as set forth in Table 1 . 

In another approach a Pinl WW domain crystal is probed with molecules 
composed of a variety of different chemical entities to determine optimal sites for 
interaction between candidate binding molecules {e.g., inhibitors) and the WW domain 
30 active site. 
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In another embodiment, an approach made possible and enabled by this 
invention, is to screen computationally small molecule data bases for chemical entities or 
compounds that can bind in whole, or in part, to the WW domain. In this screening, the 
quality of fit of such entities or compounds to the binding site may be judged either by 
5 shape complementarity or by estimated interaction energy. Meng, E. C. et al.,J. Comp. 
Chem., 13, pp. 505-524 (1992). 

A number of WW domains exist, many of which have similar functional activity. 
However, many WW domains may crystallize in more than one crystal form. Thus, the 
structure coordinates of Pinl WW domain, or portions thereof, as provided by this 
10 invention are particularly useful to solve the structure of other crystal forms of WW 
domains. They may also be used to solve the structure of a WW domain or Pinl WW 
domain mutants, WW domain co-complexes, or the crystalline form of any other protein 
with significant amino acid sequence homology to any WW domain. 

One method that may be employed for this purpose is molecular replacement. In 
15 this method, the unknown crystal structure, whether it is another WW domain crystal 
form, a WW domain or Pinl WW domain mutant, or a WW domain co-complex, or the 
crystal of some other protein with significant amino acid sequence homology to any 
WW domain, may be determined using the structure coordinates as provided in Table 1. 
This method will provide an accurate structural form for the unknown crystal more 
20 quickly and efficiently than attempting to determine such information ab initio. Those 
of skill in the art understand that a set of structure coordinates determined by X-ray 
crystallography is not without standard error. 
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TABLE 1 



Atom 


Atom 
Type 


Residue 


# 


X 


Y 


Z 


OCC 


B 


1 


C 


GLY 


1 


.297 


6 .261 


77 . 129 


1 . 00 


34 . 99 


2 


O 


GLY 


1 


. 701 


5 .443 


77 . 948 


1.00 


35 . 67 


3 


N 


GLY 


1 


-1 . 549 


6.552 


75 . 579 


1.00 


35.60 


4 


CA 


GLY 


1 


- .294 


5 .819 


75 . 824 


1 .00 


35.38 


5 


N 


SER 


2 


.327 


7 . 566 


77 .321 


1 .00 


34 . 27 


6 


CA 


SER 


2 


. 851 


8 . 106 


78 . 547 


1 . 00 


33 .27 


7 


CB 


SER 


2 


1. 393 


9.518 


78 .304 


1.00 


33.43 


8 


OG 


SER 


2 


.397 


10 .429 


77 . 856 


1 . 00 


34 . 33 


9 


C 


SER 


2 


- . 346 


8 . 118 


79.496 


1 . 00 


32 . 42 


10 


0 


SER 


2 


-1.486 


8 . 145 


79 . 045 


1 . 00 


32 . 01 


11 


N 


HIS 


3 


- . 093 


8.099 


80 .798 


1.00 


31.53 


12 


CA 


HIS 


3 


-1 . 153 


8.083 


81.796 


1.00 


30.70 


13 


CB 


HIS 


3 


- . 530 


8 .274 


83 . 186 


1.00 


30 . 18 


14 


CG 


HIS 


3 


-1.523 


8.316 


84 .304 


1 . 00 


29 . 99 


15 


CD2 


HIS 


3 


-2 .043 


7.337 


85 . 077 


1.00 


29. 88 


16 


ND1 


HIS 


3 


-2 . 102 


9.493 


84 .732 


1.00 


29. 83 


17 


CE1 


HIS 


3 


-2 . 936 


9.228 


85 . 729 


1 .00 


29.76 


18 


NE2 


HIS 


3 


-2 . 915 


7 . 930 


85 . 954 


1 . 00 


29 . 65 


19 


C 


HIS 


3 


-2 .261 


9 . Ill 


81 . 546 


1 . 00 


30 . 30 


20 


0 


HIS 


3 


-3 .428 


8 . 818 


81.767 


1.00 


30.00 


21 


N 


GLY 


4 


-1 . 899 


10 .292 


81.053 


1.00 


30. 06 


22 


CA 


GLY 


4 


-2 . 891 


11.330 


80 .812 


1 .00 


30.26 


23 


C 


GLY 


4 


-3 .491 


11. 392 


79 .417 


1 . 00 


30 . 62 


24 


0 


GLY 


4 


-3 . 962 


12 .445 


78 . 990 


1.00 


30 . 36 


25 


N 


MET 


5 


-3 .484 


10.272 


78 . 702 


1.00 


31. 06 


26 


CA 


MET 


5 


-4 . 029 


10 .239 


77 . 349 


1 . 00 


32 . 14 


27 


CB 


MET 


5 


-2 . 900 


10 . 330 


76 . 320 


1 .00 


33 . 56 


28 


CG 


MET 


5 


-1 . 877 


11.379 


76 . 665 


1 . 00 


35 . 79 


29 


SD 


MET 


5 


-1.479 


12 . 502 


75 .352 


1.00 


39.48 


30 


CE 


MET 


5 


-2 . 891 


13 . 512 


75.389 


1. 00 


38. 05 


31 


C 


MET 


5 


-4 . 816 


8 . 967 


77 . 107 


1 . 00 


32 . 01 


32 


0 


MET 


5 


-4 .592 


7 . 954 


77 . 770 


1.00 


31.57 


33 


N 


ALA 


6 


-5.743 


9.024 


76 . 156 


1 . 00 


32 . 09 


34 


CA 


ALA 


6 


-6.544 


7.858 


75 . 823 


1.00 


32 . 75 


35 


CB 


ALA 


6 


-7.620 


8 .224 


74 .808 


1.00 


32 .26 


36 


C 


ALA 


6 


-5.593 


6 . 840 


75 .226 


1 . 00 


33 . 51 


37 


O 


ALA 


6 


-4.579 


7.208 


74 . 625 


1 . 00 


32 . 76 


38 


N 


ASP 


7 


-5 . 900 


5.561 


75 .408 


1.00 


34 . 83 


39 


CA 


ASP 


7 


-5 . 050 


4.523 


74 . 847 


1.00 


36 . 77 


40 


CB 


ASP 


7 


-5 . 191 


3 .201 


75 .574 


1 . 00 


36 . 60 


41 


CG 


ASP 


7 


-4 . 345 


3 . 133 


76 .794 


1 . 00 


36.46 


42 


OD1 


ASP 


7 


-3 .300 


3 . 815 


76.824 


1.00 


36 . 44 


43 


OD2 


ASP 


7 


-4.722 


2 .392 


77.717 


1.00 


36. 77 


44 


C 


ASP 


7 


-5 .426 


4 .277 


73 .431 


1 . 00 


38.24 


45 


O 


ASP 


7 


-6.399 


4 . 833 


72 . 926 


1 .00 


38.39 


46 


N 


GLU 


8 


-4 . 675 


3 .410 


72 . 778 


1 . 00 


39 . 91 


47 


CA 


GLU 


8 


-5 . 037 


3 . 176 


71.420 


1.00 


41.61 


48 


CB 


GLU 


8 


-5 . 005 


4 .516 


70 . 665 


1 . 00 


42 .47 


49 


CG 


GLU 


8 


-3 . 630 


5 . 196 


70 . 661 


1 . 00 


43 . 94 


50 


CD 


GLU 


8 


-3 . 700 


6 . 679 


70 .399 


1 . 00 


44 . 72 


51 


OE1 


GLU 


8 


-3 .249 


7 . 133 


69.317 


1.00 


45.25 
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52 


OE2 


GLU 


8 


-4.209 


7.410 


71.267 


1 . 00 


45 . 66 


53 


c 


GLU 


8 


-4.3 50 


2.129 


7 0 . 591 


i no 

X . VJ VJ 


42 . 09 


54 


o 


GLU 


8 


- 3 . 3 95 


1 . 444 


7 0 9 o q 


i rin 

X . U VJ 


AO 9 7 


55 


N 


GLU 


9 


-4 . 94 0 


2.032 


69 .418 


X • VJ VJ 


AO 9 q 


56 


CA 


GLU 


9 


-4 . 606 


1.219 


68.258 


i nn 

x . vj vj 


4 i fin 

*± J . D VJ 


57 


CB 


GLU 


9 


-3 . 972 


2 . 124 


67 . 2 0 0 


i no 

X . VJ VJ 


44.31 


58 


CG 


GLU 


9 


-3 . 671 


3 . 542 


67 . 5 6 9 


i nn 

X . VJ VJ 


A R fil 


59 


CD 


GLU 


9 


-4.4 96 


4.518 


66 . 73 8 


i n n 


4 fi RO 
*± o . 3 vj 


60 


OE1 


GLU 


9 


-5.535 


4 . 977 


67.2 55 


i n n 

X . VJ VJ 


4 7.27 


61 


OE2 


GLU 


9 


-4.138 


4 . 816 


65 . 573 


1.00 


4 7.08 


62 


c 


GLU 


9 


-3 . 933 


- . 102 


68 . 005 


1 . 00 


4 3.44 


63 


o 


GLU 


9 


- 3 .42 0 


-.838 


68 . 845 


i n n 

x . vj vj 


4 1 fi R 


64 


N 


LYS 


10 


-3 . 912 


- . 276 


66 . 690 


i n n 

X . VJ VJ 


41 1ft 

*± 3 . J o 


65 


CA 


LYS 


10 


-3.326 


-1.359 


65 . 963 


i n n 

X . VJ U 


49 fin 

*± Z . D VJ 


66 


CB 


LYS 


10 


-4.3 63 


-2 . 078 


65 . 091 


i n n 

x . vj vj 


41 9 n 

*± 3 • VJ 


67 


CG 


LYS 


10 


-5 . 105 


-3.24 8 


65 . 6 8 8 


i n n 

X . VJ VJ 


4 1 ft 1 

*± 3 . O 3 


68 


CD 


LYS 


10 


-6.580 


-2 . 8 91 


65 . 718 


1.00 


44 . 18 


69 


CE 


LYS 


10 


-7 . 473 


-3.925 


65 .065 


i nn 

X . VJ VJ 


44 

*± *± • 3 3 


70 


NZ 


LYS 


10 


- 8 . 832 


-3.33 9 


64 . 817 


i n n 

X . VJ VJ 


44.81 


71 


C 


LYS 


10 


-2 .521 


-.539 


64 . 995 


i n n 

X . VJ VJ 


41.77 


72 


o 


LYS 


10 


-1.396 


-.087 


65.24 8 


i n n 

X . VJ VJ 


41 Qft 


73 


N 


LEU 


11 


-3.210 


-.290 


63.8 94 


i n n 

X . VJ VJ 


4 n 1 R 


74 


CA 


LEU 


11 


-2.678 


. 4 02 


62 . 758 


i n n 

X . VJ VJ 


"3 ft Q7 

JO . J / 


75 


CG 


LEU 


11 


-2 . 843 


2.889 


63 . 092 


i n n 

X . VJ VJ 


7 ft Qfi 


76 


CD1 


LEU 


11 


-4 . 12 8 


2.512 


fi 1 ft 1 4 

O 3 . O X *± 


i n n 

X . U VJ 


1 ft P 9 


77 


CD2 


LEU 


1 1 


-2 .222 


4 141 


fi 1 fiflc; 

O 3 . DOj 


i n n 
x . u u 


TO r A 
JO ■ DU 


78 


c 


LEU 


11 


-1.778 


-.681 


\j • £i X £i 


i n n 

X . VJ VJ 


17 fi ft 


79 


o 


LEU 


11 


- . 725 


- . 986 


62 . 7 7 8 


1.00 


17 QR 


80 


CB 


LEU 


11 


-1.911 


1.672 


63 . 158 


i n n 

X . VJ VJ 


■JO. _7rt 


81 


N 


PRO 


12 


- 2 .216 


-1.315 


6 1 . 12 6 


1.00 


1 fi 04 


82 


CD 


PRO 


12 


-3.360 


- . 954 


60.273 


1.00 


1 R QO 

J 3 * J VJ 


83 


CA 


PRO 


12 


-1.421 


-2.381 


60.519 


1.00 


34.36 


84 


CB 


PRO 


12 


-2 . 183 


-2.692 


5 9. 236 


1.00 


14 ft ft 
3 *± . o o 


85 


CG 


PRO 


12 


-3 .597 


-2 . 23 9 


5 9.553 


1.00 


i r in 


86 


c 


PRO 


12 


- . 049 


- 1 . 8 01 


6 0 . 24 0 


i n n 

X . VJ VJ 


19 fi7 
J Z • o / 


87 


0 


PRO 


12 


. 151 


- . 586 


60.33 0 


i n n 

X . VJ VJ 


19 1 Q 


88 


N 


PRO 


13 


. 92 9 


-2 . 652 


5 9 . 924 


i n n 

X . VJ VJ 


ii n ft 

3 X . U O 


89 


CD 


PRO 


13 


. 963 


-4 . 125 


5 9. 872 


i n n 

X . VJ VJ 


in ft q 

3 vj . O 27 


90 


CA 


PRO 


13 


2.248 


-2 . 0 84 


5 9 . 654 


1.00 


9 Q R Q 


91 


CB 


PRO 


13 


3 . 042 


-3.2 93 


5 9 . 165 


1.00 


10 1 fi 

J U . ID 


92 


CG 


PRO 


13 


2,438 


-4.419 


59 . 971 


1.00 


3 0.57 


93 


c 


PRO 


13 


2 . 140 


_ . 985 


5 8.593 


1.00 


9ft 99 


94 


o 


PRO 


13 


1.302 


- 1 . 062 


57 . 68 9 


1-00 


2 7.79 


95 


N 


GLY 


14 


2 . 966 


. 04 7 


5 8 . 73 7 


1.00 


9 fi 74 


96 


CA 


GLY 


14 


2 . 970 


1.140 


57.783 


1.00 


2 5.25 


97 


c 


GLY 


14 


2.066 


2.317 


5 8 . 0 92 


i n n 

X . VJ VJ 


94 94 


98 


o 


GLY 


14 


2.264 


3.403 


R 7 R R 1 

3 / . 3 3 3 


i n n 

X • VJ VJ 


9 1 Q9 


99 


N 


TRP 


15 


1 .081 


2 . 124 


58 . 960 


1 . 00 


23.51 


100 


CA 


TRP 


15 


. 160 


3 .205 


59. 286 


1 .00 


22 . 91 


101 


CB 


TRP 


15 


-1 . 275 


2 . 679 


59. 368 


1 . 00 


22 . 23 


102 


CG 


TRP 


15 


-1 . 845 


2 .209 


58 . 065 


1 . 00 


21 . 67 


103 


CD 2 


TRP 


15 


-2 . 524 


3 . 010 


57 . 092 


1 . 00 


21.24 


104 


CE2 


TRP 


15 


-2 . 898 


2 . 152 


56. 032 


1 . 00 


21 . 30 


105 


CE3 


TRP 


15 


-2 . 853 


4 . 371 


57 . 011 


1 . 00 


20 . 98 


106 


CD1 


TRP 


15 


-1 . 832 


. 935 


57 . 573 


1 . 00 


21.71 


107 


NE1 


TRP 


15 


-2 . 462 


. 893 


56. 352 


1 . 00 


21.40 
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10 8 


CZ2 


TPP 

X x\_ XT 


1 t: 
-L P 


1 C O Q 

- J . boo 


O iT AO 

z .buy 


54 . 902 


1.00 


21.06 


10 9 


CZ3 


X xvir 


1 Q 

X 3 




>1 o o q 

4 . o z y 


55 . 884 


1.00 


21.18 


110 




TPP 


T C 
X P 


*2 O Q D 


3 . 94 6 


54.84 5 


1.00 


21 . 04 


111 


Q 


TPP 


X P 


. 4 / u 


j . yb4 


60 . 575 


1.00 


23.13 


112 




TPP 
X xv ir 


X P 


t n *j o 


3.402 


61.519 


1.00 


22 . 54 


113 


N 


fTT.TT 


X O 


n q c 


p . z^ z 3 


60 . 5 96 


1.00 


23.37 


114 


CA 


rjT TT 
\J±J\J 


X D 


O C *7 


o.Uob 


61.763 


1.00 


24 . 62 


115 


CB 


HUT 


X o 




b . / y u 


61 . 698 


1.00 


24.57 


116 


CG 


rjT.TT 


X O 


T n Q Q 
X . / o o 


7.743 


60.532 


1.00 


24 . 62 


117 


CD 


vjXiu 


X D 


*5 o n n 


Q "31 1 


60 .44 8 


1.00 


24 . 61 


118 


OE1 


fiT.TT 


X o 


A 114 
ft . X X4 


/ . p y o 


59 . 988 


1.00 


24.24 


119 




VjXjU 


T C 
X D 


1 1 Q C 

j . j y b 


9.470 


60.857 


1.00 


24 . 99 


12 0 




(TT.TT 


X D 


Q Q A 

- . o y u 


7 . 063 


61.812 


1.00 


25.27 


121 


n 


m tt 


T £C 
X D 


T /I O O 

-1.4S / 


7.384 


60.7 84 


1 . 00 


24 . 68 


122 


N 


T,VC 


1 H 
X / 


- X . Z Z4 


7.526 


63.012 


1 . 00 


26.55 


12 3 


CA 


LYS 


1 "7 
X / 


Z . JU / 


Q /I O /T 


63 . 173 


1.00 


28.35 


124 


CB 


T,vq 
Li I O 


1 1 
X / 




O T o r 
O . Z O 5 


64 . 502 


1.00 


2 9.42 


12 5 


CG 


T,vq 

Xj X o 


X / 


~ 4 . U4 Z 


9.410 


64 . 769 


1.00 


31.85 


126 


CD 


LYS 


1 1 
X / 


- 4 . j4o 


9.476 


66 .2 04 


1.00 


33.36 


127 


CE 


LYS 


1 H 
X / 


_ cr "5 n R 


O . Z Z O 


66 .5 99 


1.00 


34.45 


12 8 


NZ 


T.VQ 


X / 


-ft . J / / 


7 . 166 


67 . 041 


1.00 


36 . 19 


129 




LYS 


X / 


_ -1 O Q T 
X . / O X 


y . y iu 


63 . 142 


1.00 


2 9.19 


13 0 


o 


T.VQ 
Xj X O 


1 "7 
X / 


c r n 


T /*t TOO 

1U . loo 


63 . 617 


1.00 


2 8.99 


131 


N 


7\pn 


1 ft 
X O 


*"> C *7 O" 
- Z . P / / 


10.818 


62.593 


1.00 


3 0.37 


132 


CA 


ARG 


1 ft 
X o 


_ O 1 Q4 
Z . X O ft 


xz . z 1 b 


62 .53 6 


1.00 


31.87 


133 


CB 


apn 


1 p 
X O 


1 . 4 jo 


12 .482 


61.233 


1.00 


32 . 64 


134 


CG 




1 fl 
X O 


O *3 O 
- . Z J O 


11 . 563 


61.069 


1 . 00 


34 . 24 


135 


CD 


ARG 


1 ft 
X o 


A Q fl 

. ft y o 


11 . bib 


59 . 7 91 


1.00 


35.78 


13 6 


NE 




1 R 
X O 


i . ±yb 


T T inn 

13 . 1 0 y 


5 9.859 


1.00 


37.48 


137 


CZ 


ARG 


1 R 
X O 


Z . D X / 


13.230 


5 9. 884 


1.00 


37.91 


138 


NH1 


ARG 


1 ft 
X o 


J . U O O 


1 A A O O 

14 . 4 z o 


59 . 952 


1.00 


38.43 


13 9 


NH2 




1 R 
X O 


j . z y X 


TO T C /I 
lz . 154 


59 . 83 0 


1.00 


38.32 


14 0 


Q 




1 R 
X O 




13 . 12 7 


62 . 672 


1 . 00 


32.46 


141 


o 




1 ft 
X o 


/I COT 
ft . 3J1 


lz . DP O 


62 . 697 


1.00 


32.20 


142 


N 


MPT 


T Q 

x y 


- O . 14 D 


14 .42 8 


62 . 777 


1.00 


33.73 


143 


CA 


MPT 


X -? 


- ft . z z 4 


15 .3 96 


62 . 922 


1.00 


35 . 12 


144 


CB 


MPT 


1 Q 
x y 


/i n ft q 
- ft . u £3 y 


i /* t /r o 

lo . loo 


64 .23 6 


1.00 


36.75 


145 


CG 


MPT 


X y 


- ft . O ft z 


T C /I C T 

lb . 4 5 7 


65 .4 52 


1.00 


3 9.07 


146 


SD 


MET 


1 Q 

X -7 


p . z u D 


x b . o 1 1 


66 . 73 5 


1.00 


41.46 


147 


CE 


MET 


x 


p . 1 J J 


T Q O C\ 1 
X o . Z U 3 


cz cz o c cr 
o b . OOP 


1.00 


41.00 


148 


c 


MET 


X 27 


ft . z y j 


1 C A f\ CZ 

lb : 4 0b 


61,7 90 


1.00 


3 5.20 


149 


o 


MET 


1 Q 
x y 


j . z o y 


T C O O C 
lo . O O O 


61.3 06 


1.00 


35.22 


150 


N 


O JZi x\. 


z u 


P . P Xft 


-\ cz o o o 
lb . / z / 


61.381 


1.00 


35.36 


151 


CA 


SER 


Z U 


P . / ft U 


17 . 7 05 


60 . 32 8 


1.00 


35 . 66 


152 


CB 


QPP 


z u 


*7 T Q -1 

- / . Xo4 


17 .60 9 


5 9.825 


1.00 


35.65 


153 


OG 


QPP 


z u 


/ . o / p 


TO o o cr 
1 O . O / b 


59.4 14 


1.00 


35.43 


154 


c 


SER 


Z u 


- P . ft o U 


T Q AQO 

i y . u y o 


60 .891 


1.00 


36.12 


155 


0 


SER 


20 


-6.091 


19.496 


61.883 


1.00 


O 3 . O D 


156 


N 


ARG 


21 


-4 . 557 


19. 828 


60 .272 


1 . 00 


36 . 76 


157 


CA 


ARG 


21 


-4 .238 


21 . 186 


60 . 711 


1 . 00 


37.41 


158 


CB 


ARG 


21 


-3 . 125 


21 . 796 


59 . 844 


1 . 00 


38 . 05 


159 


CG 


ARG 


21 


-1.694 


21.415 


60 .217 


1 . 00 


39 . 57 


160 


CD 


ARG 


21 


-1 . 064 


20.438 


59.224 


1.00 


41 . 03 


161 


NE 


ARG 


21 


-1 . 805 


20 . 360 


57 . 971 


1 . 00 


42 .01 


162 


CZ 


ARG 


21 


-1.230 


20 . 319 


56 . 770 


1 . 00 


42 . 70 


163 


NH1 


ARG 


21 


. 094 


20.355 


56 . 660 


1 .00 


43 . 34 
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164 


NH2 


ARG 


9 1 
z X 


1 O ""7 O 

- x . y / z 


2 0.225 


55 . 672 


1 . 00 


43.27 


165 


c 


ARG 


9 1 

Z X 


_ C A Q Q 


22 . 053 


60 . 579 


1.00 


37.21 


166 


0 


ARG 


9 1 
Z X 


c: 7 A 1 
3 - / U X 


22 . 983 


61.354 


1 . 00 


37.49 


167 


N 


SER 


9 9 

Z Z 


O . 3 X X 


21.729 


59 . 589 


1.00 


36.91 


168 


CA 


SER 


9 9 

Z Z 


_ •-} COT 
~ / - 3Z / 


22 . 4 73 


59.303 


1.00 


36 . 54 


169 


CB 


SER 


9 9 

Z Z 


- 7 ocn 

/ . y 3 U 


2 2.201 


57 . 856 


1.00 


37.25 


170 


OG 


SER 


9 9 
z z 




22 . 541 


57 . 643 


1.00 


37.97 


171 


c 


SER 


9 9 
Z Z 


_ Q *7 *"> C 
O . /ZD 


22 . 2 65 


60 . 227 


1 . 00 


35.96 


172 


0 


SER 


9 9 
Z Z 


- y . 3 Z 6 


2 3 .23 5 


60 . 690 


1.00 


36.13 


173 


N 


SER 


9 *3 
Z J 
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- y . u / 6 


21.012 


60.496 


1.00 


34.79 


174 


CA 


SER 


9 *5 
Z j 


i a T/i n 
- 1 U . z 4 U 


2 0.720 


61.330 


1.00 


33 . 65 


175 


CB 


SER 


9 *5 
Z J 


- 1 1 . 16 3 


19.756 


60.584 


1.00 


33 . 79 


176 


OG 


SER 


9 ^ 
Z J 


- 1 U . 3 14 


18 . 517 


60 . 357 


1.00 


34.00 


177 


Q 


ODD 


9 "X 
Z j 


- y . y b / 


2 0 . 14 8 


62 . 715 


1.00 


32 . 65 


178 


o 


SER 


9 

Z J 


- 1U . 03 3 


2 0 . 100 


63.558 


1.00 


32.30 


179 


N 


GLY 
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Z 4 


Q TOT 
- O . / Z 3 


19 . 716 


62 . 951 


1.00 


31 . 54 
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CA 


GLY 
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Z 
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- O . JOJ 


19 . 13 0 


64 .236 


1 . 00 


30.56 


181 


c 


GLY 


9 A 
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O . O 3 / 
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1.00 


29.75 
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o 


GLY 
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Z 4 


Q TAT 


16.981 


65.276 


1 . 00 


29 . 78 


183 


N 
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9 R 
Z 3 


Q A A C 


17.2 51 


63 . 169 


1 . 00 


28.87 


184 


CA 


Apr; 


9 c: 
Z 3 


- 9 . 963 


15.896 


63 . 019 


1.00 


28 . 00 


185 


CB 


ARG 


9 c. 
Z 3 


- 1U . y 3 6 


15.8 84 


61.838 


1 . 00 


29 . 15 


186 


CG 


ARG 


9 R 
Z 3 




14 .576 


61.522 


1 . 00 


30.49 


187 


CD 


ARG 


9 R 

Z 3 


1 O ET >1 O 

- lz . 34 o 


14 . 810 


60.33 0 


1.00 


31 . 64 


188 


NE 


ARG 


9 R 

Z 3 


_ 1 *5 i c i 

X 3 . lOJ 


13 . 5 92 


59.806 


1 . 00 


32 . 91 


189 


CZ 


ARG 


9 r 

Z 3 


- x 3 . / y D 


13 .531 


58.645 


1.00 


32 . 93 


190 


NH1 


ARG 


9 R 

Z 3 


_ 1 *> Q1 l 

x 3 . y x 3 


14 .617 


57.8 92 


1 . 00 


33 . 35 
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NH2 


ARG 


9 R 

Z 3 


- 1 A TIC 
14 . 3 X D 


xZ . 3 o4 


58.232 


1.00 


33 . 50 
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C 


ARG 


9 R 

Z 3 


_ q one 


1/1 Q1 C 

14 . y 16 


62 . 783 


1 . 00 


27.17 


193 


o 


ARG 


9 R 

Z 3 


/ . O 4 O 


15.235 


62.081 


1.00 


26.80 


194 


N 


VAL 


9 ^ 
Z D 


o . o y 4 


13 . 732 


63 .380 


1 . 00 


25 . 90 


195 


CA 


VAL 


Z D 


- 7 QC1 
/ . O 3 1 


TO Tin 

12 .717 


63 . 22 3 


1 . 00 


25 .40 


196 


CB 


V7AT. 

V .rtXj 


9 £C 
Z O 


- / . y4 3 


11.64 0 


64 . 341 


1 . 00 


25 .46 


197 
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9 ^ 

z o 


"7 ACQ 
- / . U 3 O 


10 .451 


64.006 


1 . 00 


25 . 92 


198 


CG2 


VAL 


9 ^ 
Z O 


7 C O /I 
/ . 3Z4 


12 .233 


65 . 675 


1.00 


25 . 82 


199 


c 


VAL 


9 ^ 
z o 


/.you 


12 . 007 


61.877 


1 . 00 


24 . 87 


200 


o 


VAL 


9 £ 

Z D 


- y . u 3 4 


11 . 824 


61.353 


1.00 


25 .20 


201 


N 


TYR 


9 7 

z / 


_ <r pi q 

o . £3 x y 


11 . 6z 1 


61.3 13 


1 . 00 


24.40 


202 


CA 


TYR 


2 7 


D . uU J 


1 A O Q CZ 
1 U . O O O 


60 . 051 


1.00 


23.42 


203 


CG 


TYR 


9 7 
z / 


3 . 4 4 Z 


12 . 563 


58.5 82 


1 . 00 


24 . 14 


204 


CD1 


TYR 


2 7 


-A A"\A 
t ■ 4,54 


11 Q O C 

ii . y o 3 


[— <-J OA'S 

3 7 . 8 03 


1.00 


24 . 61 


205 


CE1 


TYR 


9 7 
z / 


- 3 .ZOO 


12 . 688 


57 . 4 96 


1 . 00 


24 . 81 


206 


CD 2 


TYR 


9 7 
z / 


3 . Z 4 3 


13 .85 8 


5 9.055 


1 . 00 


24 . 39 


207 


CE2 


TYT? 

1 IK 


9 7 
Z / 


A A Q >1 

-4 . U o4 


14 .567 


58 . 758 


1 . 00 


25 . 05 


208 


CZ 


TYR 


9 7 
z / 


_ O T A A 


13 .981 


57.97 8 


1 . 00 


25.21 


209 


OH 


TYR 


9 7 
z / 


- 1 Q 

x . y O Z 


14 . 698 


5 7.683 


1 . 00 


25.76 


210 


c 


TYR 


9 7 
z / 


3 . O Z 4 


y . y z 5 


60 . 069 


1 . 00 


23.02 


211 


O 


TYR 


27 


-4 . 774 


9 . 999 


60 . 960 


1.00 


22 . 89 


212 


CB 


TYR 


27 


-6 . 746 


11.836 


58 . 839 


1 . 00 


23 . 85 


213 


N 


TYR 


28 


-5.575 


9 . 014 


59 . 103 


1 . 00 


21 . 88 


214 


CA 


TYR 


28 


-4 . 504 


8 . 034 


59 . 074 


1.00 


21 . 16 


215 


CB 


TYR 


28 


-5 . 095 


6 . 632 


59 . 223 


1 . 00 


22 .27 


216 


CG 


TYR 


28 


-5 . 914 


6.520 


60 .488 


1 .00 


23 .24 


217 


CD1 


TYR 


28 


-7.210 


7 . 032 


60 . 550 


1.00 


24 . 01 


218 
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28 


-7 . 926 
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61.745 


1 . 00 


24 . 83 
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28 


-5.358 


5 . 997 


61 . 652 


1 . 00 


23 . 92 



18 



220 
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O • U D Z 


D . yyo 
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1 . UU 
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CZ 
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& R 1 Q 
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. O 27 X 


i n n 
1 . uu 


Z 3 . 1 / 
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OH 
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- 8 . 019 
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1 . v u 
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Z O . 3 U 


223 


c 


TYR 


2 8 
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ft 1 "JQ 
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1 . U u 
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ft 
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3D. / Z 3 
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19.01 
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CA 
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29 


• *± O X 
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7 7 


Q A ft ft 


3D . 3 U O 
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1 ft A ^ 




C A 7 ft £T 
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1 . UU 
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cz 
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29 
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N 
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CA 
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C. 9 ^ 9 
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1 . U U 


17.50 


238 


CB 


ASN 
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54 . 4 64 


1.00 


17.63 


239 


CG 


ASN 
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n ai 

• / *± X 




34 .zo / 


*1 A A 
1 . 00 


17 . 60 


240 


OD1 


ASN 
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1 Q1 ^ 


1 O 7 7 
J . U Jz 


c ^ o n o 

3 3 . 2»U O 


1.00 


17.34 


241 


ND2 


ASN 
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0 ft ft 

. u o o 


1 ft Q 7 


34 . 3 O 3 


1.00 


18.35 


242 


c 


ASN 


3 0 


1 . 92 8 


R *7 ft 1 
3 . /OX 


33.1/1/ 


1.00 


17.59 


243 


0 


ASN 


3 0 


X . 27 / 3 


A~\ A 
O . 4 X4 


3 3 . 3 03 


1 A A 

1 . 00 


16 . 77 


244 


N 


HIS 


3 1 


i on ^ 


3 . jj / 


33. / / 4 


1.00 


17 . 16 


245 


CA 


HIS 


3 1 


4 ^ n 

*± . 3 U O 


^ n "7 r 

O . U / 3 


3 3.3// 


1 . 00 


17.86 


246 


CB 


HIS 


3 1 


5.202 


D.J XZ 


3 O . O U / 


1.00 


18.00 


247 


CG 


HIS 


3 1 


R Zt HA 


R TOR 
3 . 1U3 


5 7 .470 


1.00 


18.54 


248 


CD 2 


HIS 


3 1 


6 . 042 




3 / . £ 6 4 


1.00 


18.99 


249 


ND1 


HIS 


3 1 


*± . 27 3 3 


r n a i 

3 . U 4 1 


3 O . / / Z 


1.00 


18 . 72 


250 


CE1 


HIS 


3 1 


5.310 


ft ft 7 


3 27 . 3 U 1 


1 . 00 


19.47 


251 


NE2 


HIS 


3 1 


5 . 971 


^ 1 Q"3. 


C ft "3 Q O 
3 0.300 


1.00 


18 . 96 


252 


c 


HIS 


3 1 


5 . 052 


R 9 R Q 
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34 . 3 Z O 


1 . 00 


17 . 93 


253 


0 


HIS 


3 1 


6.205 


R C. A 
-J . 3 3 ft 


C /I n i n 
34 . U 1 27 


1.00 


18 . 12 


254 


N 


ILE 


32 
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A 9 A 1 
ft . Z ft J 


cr o Tin 
3 3. / / 3 


1 . 00 


17.83 


255 


CA 


ILE 


32 


5 . 006 


3 . re Z 3 


CO TOO 
3 . / Z 27 


1.00 


17 . 67 


256 


CB 


ILE 


32 


4 . 772 


1 QAQ 
X . 27 V 27 


3 Z . 27 b O 


1 . 00 


18.12 


257 


CG2 


ILE 


32 


5.434 


X . \J 27 / 


CI OCA 
3 1 . O 3 4 


1.00 


17 . 77 


258 


CGI 


ILE 


32 


5.357 


X . *± 27 Z 


c A T51 
3 4 . 3 Z 1 


1 . 00 


18.34 


259 


CD1 


ILE 


32 


5.138 


n 9 7 

. U Z / 


34 . b 3 b 


1.00 


19.67 


260 


c 


ILE 


32 


4-382 


^ , 8 9. 7 


q 1 ft "7 


1.00 


17.44 


261 


0 


ILE 


32 


5.078 


J . 27 D u 


3 u . 3 / y 


1.00 


17.36 


262 


N 


THR 


33 


3 . 067 


4 fl 7 R 

t; . U J J 


CI O ft c 
3 1 . 3 O 3 


1 . 00 


17.20 


263 


CA 


THR 


33 


2.348 


4 A 9 R 
*± . 4 Z 3 


3U . 1 / U 


1.00 


17.33 


264 


CB 


THR 


3 3 


. 955 


^ 7 7 c. 
3 . / / 3 


3 U . 1 U 27 


1.00 


17.42 


265 


OG1 


THR 


33 


. 134 


4 . J 3 O 


c: T 1 /i i 
3 1 . 14 1 


1.00 


17.75 


266 


CG2 


THR 


3 3 


1.050 


9 9 71 

^ . Z / X 


c r\ ;i q 

3U . 3 1 27 


-I A A 

1.00 


18.24 


267 


C 


THR 


33 


2 . 131 


5 . 939 


50 . 135 


1.00 


17.18 


268 


0 


THR 


33 


1 . 838 


6 . 512 


49 . 085 


1 . 00 


16.30 


269 


N 


ASN 


34 


2 .270 


6 . 568 


51 .299 


1 . 00 


17 .25 


270 


CA 


ASN 


34 


2 . 075 


8 . 002 


51.467 


1.00 


17.74 


271 


CB 


ASN 


34 


3 . 024 


8 . 795 


50 . 559 


1 . 00 


18 . 75 


272 


CG 


ASN 


34 


4 .464 


8 . 737 


51. 047 


1 . 00 


20 . 18 


273 


OD1 


ASN 


34 


4 . 712 


8 .518 


52 . 236 


1 . 00 


21.35 


274 


ND2 


ASN 


34 


5 .416 


8 . 947 


50 . 144 


1 . 00 


20.59 


275 


C 


ASN 


34 


. 629 


8 .443 


51.262 


1.00 


17 .42 
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276 


o 


ASN 


34 


.347 


9 (SOI 


tri n i / 
3 X . U J ft 


X . U U 


17.41 


111 


N 


ALA 


3 5 


-.291 


7 4 91 


3 X . j /X 


1 fifi 
1 . U U 


1 O OA 

X / . o4 


278 


CA 


ALA 


3 5 


- 1 . 711 


7 7 99 
/ . / y z 


CI 9 9 A 
3 X . Z Z fl 


i n r\ 
X . U U 


TO O O 


279 


CB 


ALA 


3 5 


_ 9 a c o 

Z • *± 3 o 


G. c c 7 


3 U . / ft X 


i An 
1 . UU 


TO TO 

lb . Iz 


2 8 0 


c 


ALA 


7 c 

J 3 


-9 "jnn 

Z . J v u 


ft 9 £ 9 
O . Z O Z 


CO ceo 
3Z . 33 J 


1.00 


19.15 


2 81 


o 


ALA 


7 C 
J 3 


"1 Q A 7 

X . Oft / 


""7 Q A Q 


5 3 . 62 0 


1 . 00 


18.60 


2 82 


N 


SER 


7 £ 


_ 0 -2 A Q 
j . J U _7 


9 lOQ 

y . x z y 


CO A O O 

3Z .4 / z 


1.00 


19 . 8 1 


2 83 


CA 


SER 


7 


_ -\ QQQ 


9 C 1 

_7 . O 3 X 


co ccn 

3 J . O 3 U 


1 . u u 


21.25 


2 84 


CB 


SER 


3 6 


_ o poo 
J . o z o 


11 17 9 
XX. X / Z 


CO O C O 
3 J . / 3 Z 


1 . 00 


21.07 


2 85 


OG 


SER 


3 6 


- 9 A ft £ 


x X . j jU 


c n o c 
34 . U ju 


1.00 


2 2.28 


286 


c 


SER 


7 


_ c A Q O 
3 . rt O / 


Q 7 9 C 
y . J Z 3 


CO C/t 1 
3 J . 34 X 


1.00 


22 . 07 


2 87 


o 


SER 


7 fl 


_ £T no c 
D . U / 3 


9 A AH 
y . 4 4 / 


CO /i /r o 


1 . 00 


21.66 


288 


M 


GLN 


3 7 




ft 91 n 

O • _7 X U 


c a ^ c n 

3 ft . D 3 U 


T A A 

1 . UU 


23.28 


289 


CA 


GLN 


3 7 


- 7 c i £ 

/ . 3 -L D 


ft c o. 

O . 3 0 O 


C A £Z C Q 

34 . oo y 


1 . 00 


24 . 65 


2 90 


CB 


GLN 


7 7 


- 7 7 7 9 
/ • / «5 ;7 


7 1 1 9 


C /l O T 7 


1.00 


24 . 61 


291 


CG 


GLN 


3 7 


- 7 fi ft 4 


£ 119 

O - X X -7 


C C 9 C Q 
3 3 . Z D y 


-» A A 

1 . 00 


24 . 96 


2 92 


CD 


GLN 


7 7 




A ^ 7 C 
ft . D / O 


C A QCO 

3 4 . ooo 


1 A A 

1 . 00 


2 5.43 


293 


OE1 


GLN 


3 7 


- 7 . 116 


4 9 ft 9 
f± . Z O y 


CO TT/I 
3 J . / Z 4 


T A A 

1 . 00 


25 . 92 


294 


NE2 


GLN 


7 7 


- 7 ft 1 A 
/ . O X ft 


O q «7 o 

o . o / z 


C C O 1 A 
33 . Ol4 


1 . 00 


2 6.05 


2 95 


c 


GLN 


3 7 


- ft IDA 
O . J. \J *± 


ft ft ft 1 
o . o O X 


3 o . U 4 / 


T A A 

1.00 


25 . 98 


296 


o 


GLN 


3 7 


- 7 414 


ft 7 7 C 

O . / / 3 


CO c\ C Q 


-1 A A 

1 . 00 


2 5 . 84 


297 


N 


TRP 


3 8 


- 9 7 ft 1 


9 9 C 9 

-7 . Z 3 Z 


3 D . U / D 


1 A A 

1 . UU 


O O "~3 O 
Z / . /Z 


298 


CA 


TRP 


3 8 


-in n 7 a 


9 RCR 


CO" OOO 
3 / . .3 Z Z 


1 . 00 


2 9.72 


299 


CB 


TRP 


3 8 


-11.405 


in 9 £ ft 

X w .zoo 


C 7 n 9 A 
3 / . U Z 4 


1 A A 

X . u u 


O 0 O Q 

z y . z y 


300 


CG 


TRP 


3 8 


- 1 1 9 7 7 


11 £4 7 


C £T A £ 1 
3D . 4 O X 


1 A A 

1 . u u 


OA 1 A 

z y . 14 


301 


CD 2 


TRP 


7 ft 


X VJ . ODD 


19 11 C 
XZ . / / O 


CO r» Q O 

3 / . u y z 


T A A 
1 . UU 


2 8.85 


302 


CE2 


TRP 


3 8 


-in ft H9 

x u . o \j z 


17 ft 7 9 
X O . O / Z 


C £Z o n Q 

3 d . z u y 


1 A A 
1 . U 0 


2 8 . 94 


303 


CE3 


TRP 


7 ft 


-in n i 9 
x u . U X Z 


XZ . Z7 / / 


5 8.316 


1 . 00 


2 9.21 


304 


CD1 


TRP 


7 ft 


- 1 1 7 A C 
X X - / ft 3 


19 nflc 

X z . U O 3 


c c o c o 

3 3 . Z 3 / 


1.00 


2 9.21 


305 


NE1 


TRP 


7 ft 


- 1 1 4 £ 
XX . *± O D 


17 A 9 1 
X J . ft Z X 


c c n cn 

3 3 . u y / 


-1 A A 

1 . u u 


2 9.11 


306 


CZ2 


TRP 


7 ft 


-in ^ i n 

XL/ • J X U 


X 3 . X ^ 3 


c a c r\ q 

3b . buy 


"1 A A 
1 . UU 


2 8.81 


3 07 


CZ3 


TRP 


7 ft 


- Q C 9 1 
-7 . 3 Z X 


1 A 9 A A 
X*t . Z ^fc ft 


C O CIO 

jo . bio 


1 . 00 


2 8.69 


3 08 


CH2 


TRP 


7 ft 


-7 . D / 3 


t c in 

ID . jIZ 


CO 1 1 r 

3 / . / 15 


1.00 


2 8.99 


309 


c 


TRP 


3 8 


-in 7^9 

X VJ . j O Z 


ft 7 7 £ 

U . J JO 


C Q 1 O O 

bo.l/i 


-I A A 

1 . u u 


31.49 


310 


0 


TRP 


3 8 


-in 9 n n 


ft 1 CA 
O . J 3*t 


C Q O Q O 

by . j y z 


T A A 
1 . U U 


3 1 . 14 


311 


N 


GLU 


3 9 


-in ft n i 

x yj . o u i 


7 9 7 n 
/ . z / u 


CO C 1 Q 

b / . bio 


-l A A 
X . U U 


3 3.88 


312 


CA 


GLU 


3 9 


- 1 1 i c n 

XX . X 3 U 


^ n 7 7 


C Q 1 Q C 

bo . iy b 


-l A A 
1 . U U 


3 6.66 


313 


CB 


GLU 


3 9 


- 1 1 9 7 1 

XX . J O X 


c: i a 9 

3 . Xfi Z 


CO ooc 
b / . z z b 


1 A A 

1 . u u 


3 7.87 


314 


CG 


GLU 


3 9 


- 1 1 7 7 9 

XX. / J Z 


7 

J . D ft 3 


CO 7 Q 9 

b / . 6 y z 


-i a A 

X . u u 


A A T T 
4 U . 1 1 


315 


CD 


GLU 


7 9 


-in Q9 9 
X U . 17 Z Z 


7 n a c: 

J . U ft 3 


cr ez o C c 

b b . z b b 


1 . 00 


41.50 


316 


OE1 


GLU 


3 9 


-in 977 


i ft n 9 
X . o u z 


C £>. HOC 


i n q 


42 .21 


317 


OE2 


GLU 


7 9 

O Z7 


-in 9 ^ 7 


7 ft 1 ft 


C C COT 

b b . bz 1 


1 A A 

1.00 


42.41 


318 


c 


GLU 


3 9 


- 9 9^ n 


c; 9 ft 7 

3 . Z O O 


bO. / D 4 


1 A A 

X . u u 


O O O IT 


319 


0 


GLU 


3 9 


- ft 99 ft 

O • -7 Z O 


c; 17 7 
J • 1 J J 


co i n a 
3 o . X U4 


-\ A A 
1 . U U 


3 7.68 


320 


N 


ARG 


4 0 


-in n 9 7 

X VJ . U _7 / 


4 ft 9 1 
ft . O Z X 


ez n r\ n o 
OU . u U z 


T A A 
X . U U 


3 9.22 


321 


CA 


ARG 


4 0 


-y • V X D 


a n ^ ^ 

ft . U D O 


o U . bUU 


1.00 


41.26 
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CB 


ARG 
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-9.346 


7 7 4 


o A 9 o 
DZ . UZ / 


1 A A 

X . u u 


/IT Q O 

4 1 . y 7 


323 


CG 


ARG 


40 


-8 .215 


2 . 858 


62.681 


1.00 


43.45 


324 


CD 


ARG 


40 


-8 .414 


2 .794 


64 . 176 


1 . 00 


44 . 68 


325 


NE 


ARG 


40 


-9.631 


2 . 070 


64 . 530 


1.00 


46.28 


326 


CZ 


ARG 


40 


-10 . 399 


2 . 374 


65. 571 


1. 00 


47.26 


327 


NH1 
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40 


-10 . 079 


3 .399 


66.349 


1 . 00 


47 . 90 


328 


NH2 
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40 


-11.465 


1.635 


65 . 863 


1 . 00 


47 . 94 


329 


C 
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40 


-8.889 


2 . 847 


59 . 709 


1 . 00 


42 . 22 


330 


0 
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40 


-9 . 829 
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59. 568 


1 . 00 
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N 
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41 
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CD 
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CA 
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CB 
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4 1 
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1 f\f\A 


3 / . / 3 3 


1 fin 


A "3 Q Q 
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335 


CG 


PRO 


4 1 


- 5 .4 52 


9 91 c; 

Z . X 3 


R ft Q7"3 
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1 nn 
x . v u 


A 7 R 7 
*fc 3 . 3 Z 


336 


c 


PRO 
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- 7 . 74 7 


. Z 27 3 


3 27 . u / 0 


i on 


A C. £ 7 
ft 3 . O / 


337 


O 
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- 7 Q 

• / O 27 


3 O . D X ft 


1 . uu 


fx 3 . 27 O 
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N 


SER 


42 


- 7 .4 72 
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AH icq 
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CA 


SER 


42 
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- 4 R ft 
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O X . ft X 27 


i nn 
x . u u 


Aft AT 
ft O . ft X 


340 


CB 


SER 


42 


-8.015 


- 1 . 84 8 


6 0 . 914 


i n n 

X . VJ u 


Aft 7 ^ 
ft D . / D 


341 


OG 


SER 


42 


- 7 . 942 
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£ 1 Q41 
OX . 27 ft X 


1 n n 
x . u u 


A Q OI 
ft 27 . Z X 
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SER 


42 


- 6 . 08 0 


. J-7D 
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O X . O / ft 
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x . u u 
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ft O . 27 ft 
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O 


SER 


42 
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3 5.500 


1.00 


46 . 94 


482 


CB 


ARG 


72 


2 2 .259 


00 0 1 CZ 
Zj . ZlD 


3 6.7 94 


1.00 


48.06 


483 


CG 


ARG 


72 


9 1 £ ft4 


9 9 COC 


30 . 0 0 5 


1.00 


4 9.63 


484 


CD 


ARG 


72 


9 1 4 7 7 
z 1 . ^ / / 


ZJ . 3 X O 


3 9.070 


1.00 


50.70 


485 


NE 


ARG 


72 


2 0.485 


9 7 noi 
Z 3 . U 27 X 


4 U . U4 1 


1.00 


51.89 


486 


CZ 


ARG 


72 


2 0 .430 


Z 3 . DUX 


/ 1 OCT 

4 X . z oz 


1.00 


52.35 


487 


NH1 


ARG 


72 


21.324 


9 4 con 
Z ft . DZU 


41 . 5 95 


1.00 


52 . 65 


488 


NH2 


ARG 


72 


19 4 ft 9 


9 7 9 "5 n 
Z 3 . Z jU 


A *"> TOO 

4 z . X z 0 


1.00 


52.66 


489 


c 


ARG 


72 


2 3 . 062 


9 9 4 7 Q 
Z ^ . *± / 27 


14 cm 

3 4 . DUX 


1 A A 

X . u 0 


46.51 


490 


O 


ARG 


72 


24 . 184 


22 . 798 


35 . 024 


1.00 


46 . 64 


491 


N 


ARG 


73 


22 .814 


22 . 081 


33 .360 


1 . 00 


45 . 96 


492 


CA 


ARG 


73 


23 . 879 


21 . 899 


32 .398 


1.00 


45.39 


493 


CG 


ARG 


73 


24 . 151 


19 . 544 


31 . 589 


1 . 00 


47 . 17 


494 


CD 


ARG 


73 


23 .266 


18 .375 


31.298 


1.00 


47 . 97 


495 


NE 


ARG 


73 


23 . 914 


17 . 121 


31.650 


1. 00 


48 . 93 


496 


CZ 


ARG 


73 


23 .247 


16 . 008 


31 . 924 


1 . 00 


49 .41 


497 


NH1 


ARG 


73 


21.919 


16 . 010 


31.889 


1 . 00 


49 . 70 


498 


NH2 


ARG 


73 


23 . 903 


14 . 894 


32 .224 


1.00 


49 . 54 




23 



499 


c 


ARG 


73 


24 - 2 99 


2 3 . 18 7 


7 1 7 64. 

3 X . / D ft 


1.00 


44.29 


500 


o 


ARG 


73 


2 3 . 573 


24 . 172 


31.815 


1.00 


A A 1 Q 
ft ft . X 2? 


501 


CB 


ARG 


73 


2 3 .446 


9 0 ft 6 9 
Z U • o o z 


7 1 7 A 7 


1 00 

X . VJ \J 


A 6 n 7 


502 


N 


PRO 


74 


25 .508 


O 7 "I Q4 
Z 3 • X 27 ft 


7 1 9 07 

3 X . / 


1 0 0 

X . VJ VJ 


A 7 9 Q 


503 


CD 


PRO 


74 


26 .4 63 


22 .081 


31.3 12 


1.00 


43 . 00 


504 


CA 


PRO 


74 


2 6 . 121 


z *± . j o 3 


7 O t:d 9 

J VJ . D t Z 


1 00 
x . vj \J 


A 9 9 7 
1Z . z ^ 


505 


CB 


PRO 


74 


2 7 .530 


9 7 ft 9 R 
z j . o z 3 


7 n 9 7 7 
J U . Z J / 


1 OO 
X . vj u 


A 9 6ft 
1 Z . 0 0 


506 


CG 


PRO 


74 


27 . 756 


9 9 ft 1 R 

Z Z • O JLD 


7 1 

O -L . 3 ft 3 


1 0 0 

X . VJ VJ 


A 7 n ft 

*± 3 . VJ O 


507 


c 


PRO 


74 


25.3 65 


24 . 817 


OQ 7 10 
Z 27 . J X Z 


1 0 0 

X . VJ VJ 


4 1 4 9 

rt X • *± Z 


508 


o 


PRO 


74 


24 .6 05 


2 4 . 0 71 


9ft 6 ft 6 
z 0 . 0 0 0 


1.00 


4 0.74 


509 


N 


SER 


7 5 


2 5.596 


9 6 0 ft 6 

Z D • V O O 


z 0 . y 27 13 


1 0 0 


4 0 7 0 

ft VJ . 3 U 


510 


CA 


SER 


75 


2 4 .978 


9 6 77Q 

Z D . / / 27 


9 7 ft 7 1 
Z / . O / X 


1 0 0 
x . u u 


7 Q 6 c: 
3 -7 . D 3 


511 


CB 


SER 


75 


2 5.718 


2 8.0 96 


9 7 6 R9 
z / . 0 3 z 


1 0 0 

X . VJ \J 


7 9 Aq 

3 27 . *± 3 


512 


OG 


SER 


75 


2 7.103 


2 7 . 84 9 


2 7 .475 


1 0 0 


7ft 6 Q 


513 


c 


SER 


7 5 


24 . 919 


9 6 n 7 n 


9 6 R A 9 
Z O . 3 ft Z 


i no 

X . U U 


7 Q O Q 

3 J? . u y 


514 


o 


SER 


75 


2 3 . 947 


2 6 . 156 


Z 3 . ouu 


1 0 0 

X . VJ \J 


7 Q 9 9 
3 _7 . Z Z 


515 


N 


SER 


76 


25.964 


25.2 68 


2 6 . 243 


1.00 


7 ft 4 R 
3 0 . *± 3 


516 


CA 


SER 


76 


26 . 042 


24 .536 


24 . 983 


1.00 


7 7 9 0 

3 / . 27 VJ 


517 


CB 


SER 


76 


27 . 432 


2 3 . 9 07 


9 J. ft 7 ft 

Z ft . OOO 


1 0 0 

X . \J VJ 


7 7 ft 6 
3 / . 0 D 


518 


OG 


SER 


76 


2 7 . 655 


2 2.916 


9 R ft 9 R 

Z 3 • OZ J 


1 O O 
X . VJ vj 


7 7 4 1 
3 / . ft X 


519 


Q 


SER 


7 6 


24 . 974 


9 7 ACIP 
Z J . ft 3 O 


9 A 7 7 9 
Z ft . / / Z 


1 0 0 
x . U U 


7 1 7Q 
3 / . / O 


52 0 


o 


SER 


76 


24 .695 


9 7 0 7 9 

Z 3 m VJ / Z 


9 7 6 7 6 

Z J . U J 0 


1 0 0 
X . U u 


7 7 OA 
3 / . U ft 


521 


N 


TRP 


77 


24.3 76 


22 . 980 


oq R R Q 

Z 3 . O 3 -7 


1 O O 

X . \J VJ 


7ft n 1 
3 O . U X 


522 


CA 


TRP 


7 7 


2 3 .3 53 


91 Q^Q 

Z X . 7 J ? 


779 
Z 3 . / / Z 


i on 
x . u u 


7 fl 7 Q 
3 O . 3 i7 


523 


CB 


TRP 


7 7 


2 3 . 104 


91 7 91 
z j. • 0 z j. 


9 7 147 
Z / • Xt* / 


i 0 0 


3 O . O -7 


524 


CG 


TRP 


7 7 


24.2 02 


9 0 A 7 6 
Z VJ . ft 3 0 


9 7 6 HA 

Z / . O U ft 


1 0 0 


7 Q 1 7 
3 i7 . X 3 


525 


CD 2 


TRP 


77 


24.190 


19.007 


9 7 69 0 
z / . oz u 


1 0 0 


7 Q R A 
3 J7 . 3 ft 


526 


CE2 


TRP 


77 


25 .453 


18.5 83 


2 8 084 


1 on 

X . VJ U 


7 Q 6 7 
3 j7 . D 3 


527 


CE3 


TRP 


77 


23 .235 


18.03 9 


2 7.281 


1 on 

X . VJ u 


7 Q "7 ft 
3 27 . / 0 


528 


CD1 


TRP 


77 


25 .432 


2 0 . 816 


9 R o^n 

Z O . U jU 


1 0 0 
x . vj u 


7 Q AT 
3 27 . ft X 


529 


NE1 


TRP 


77 


26.193 


19.710 


9ft 741 

z 0 . J fx X 


1 0 0 
x . u u 


7 Q £ n 
3 27 . 0 U 


530 


CZ2 


TRP 


77 


25.790 


17 .235 


2 8.223 


1 0 0 

X . \J VJ 


7 Q Q7 
3 27 . 27 3 


531 


CZ3 


TRP 


7 7 


23.567 


1 6 69R 


9 7 41 Q 
Z / . ft X 17 


t n n 


7 Q Q Q 
3 27 . 27 27 


532 


CH2 


TRP 


7 7 


24 . 837 


16 7 0 7 


9 7 ft ft A 
Z / . O 0 ft 


1 n n 


7 Q Q Q 
3 27 . y O 


533 


c 


TRP 


77 


22 .013 


2 9 7 97 
^> . 3 27 3 


9c 9 O Q 

Z 3 . Z U 27 


i on 
1 . uu 


3 0.00 


534 


o 


TRP 


77 


21.617 


2 3 . 545 


7 7 7 
z 3 • j / 3 


1 0 0 

X . VJ u 


7 R 17 
3 O . X 3 


535 


N 


ARG 


7 8 


21.314 


21 .473 


9 4 R R 0 
z ft . 3 3 vj 


1 00 
x . u u 


7 Q 1 R 
3 27 . X O 


536 


CA 


ARG 


7 8 


20 000 


21.779 


9 4 0 0 6 
z f± . \j \j 0 


i on 
x . u u 


An no 

ft U . U Z 


537 


CB 


ARG 


78 


19.468 


2 0 . 624 


2 3 . 15 9 


i on 
x . vj u 


An R 7 

ft U . 3 / 


538 


CG 


ARG 


78 


20.303 


2 0.287 


2 1 . 94 8 


1 0 0 
X . u u 


AT R O 
ft X . 3 Z 


539 


CD 


ARG 


78 


19 . 546 


19,358 


21.017 


1 on 

X . VJ \J 


AO £ O 
ft z . 0 z 


540 


NE 


ARG 


7 8 


2 0.367 


18 . 942 


1 9 ftftA 

X -7 . O O ft 


i on 
x . u u 


A 7 R n 
f± 3 . 3 U 


541 


cz 


ARG 


78 


19.899 


18.314 


18 . 810 


1 O O 

X . VJ \J 


A A 17 
ft ft . X / 


542 


NH1 


ARG 


78 


18.60 9 


18 . 027 


1ft 716 
x 0 . / x 0 


t no 
x . u u 


A A A R 
ft ft . ft 3 


543 


NH2 


ARG 


78 


2 0.725 


17 . 971 


17 . 83 0 


1 on 
X . u u 


A A A Q 
ft ft . ft 27 


544 


c 


ARG 


7 8 


19 . 072 


21. 97 8 


9 R 1 Q A 

Z 3 . X 27 ft 


i on 


An 7 R 

ft U . 3 3 


545 


o 


ARG 


78 


19.370 


21 . 521 


9 6 7 0 0 

Z O . J U U 


i on 
x . u u 


An 1 R 

ft U . X 3 


546 


N 


GLN 


79 


17 . 950 


22 . 654 


24 . 972 


1 . 00 


40.81 


547 


CA 


GLN 


79 


16.994 


22 . 880 


26.049 


1 . 00 


41.57 


548 


CB 


GLN 


79 


15 . 840 


23 . 763 


25 . 561 


1 . 00 


42 .48 


549 


CG 


GLN 


79 


14 . 838 


24 . 149 


26 . 645 


1 . 00 


44 .22 


550 


CD 


GLN 


79 


15 .473 


24 . 908 


27 . 798 


1. 00 


45 . 14 


551 


OE1 


GLN 


79 


14 . 780 


25 . 377 


28 . 700 


1 . 00 


46.06 


552 


NE2 


GLN 


79 


16.796 


25 . 030 


27 . 776 


1 . 00 


45 . 97 


553 


C 


GLN 


79 


16 .480 


21.509 


26 . 476 


1. 00 


41.41 


554 


0 


GLN 


79 


16 .296 


20 . 621 


25 . 641 


1 . 00 


41 . 16 




24 



555 


N 


GLU 


80 


16.2 82 


21.317 


2 7.774 


1.00 


41.26 


556 


CA 


GLU 


80 


15.805 


2 0.028 


2 8 . 242 


1.00 


41.34 


557 


CB 


GLU 


8 0 


16.360 


19.704 


2 9.629 


1.00 


42 . 17 


558 


CG 


GLU 


80 


17 . 053 


18.345 


29 . 680 


1.00 


43.75 


559 


CD 


GLU 


80 


16.531 


17.3 88 


2 8 . 614 


1.00 


44 . 60 


560 


OE1 


GLU 


80 


16.83 9 


17.5 86 


2 7.415 


1.00 


4 5.22 


561 


OE2 


GLU 


8 0 


15.8 04 


16.44 0 


2 8 . 974 


1.00 


4 5.32 


562 


c 


GLU 


8 0 


14.2 86 


19 . 93 3 


2 8 . 252 


1.00 


4 0.71 


563 


o 


GLU 


80 


13 . 594 


2 0.903 


2 8 . 556 


1.00 


4 0.89 


564 


N 


LYS 


8 1 


13 .776 


18.753 


2 7 . 907 


1.00 


3 9.75 


565 


CA 


LYS 


8 1 


12 .33 5 


18 . 527 


9 7 Q f. 7 


i no 


7ft 7 7 


566 


CB 


LYS 


8 1 


12 .03 8 


17 . 092 


97 a 9 9 


i 0 n 


TO C O 
J J . J J 


567 


CG 


LYS 


81 


12.52 8 


16.771 


2 6.017 


i on 




568 


CD 


LYS 


8 1 


12.151 


15 . 3 63 


9 q S Q ft 


1 00 


1 X . 3 D 


56 9 


CE 


LYS 


8 1 


12 .619 


15 066 


9 A 1 ft 7 
^ *± . X O J 


1 nn 


AO Oft 


570 


NZ 


LYS 


8 1 


12 .261 


13 . 6 72 


2 3 . 7 98 


i on 


47 no 


571 


c 


LYS 


81 


11.663 


18 . 812 


2 9.202 


1 00 

X . KJ \J 


7 7 A 1 

-3 / . rt X 


572 


o 


LYS 


8 1 


10.62 0 


19.468 


2 9.253 


1.00 


7 7 R7 


573 


N 


ILE 


82 


12.2 56 


18 .318 


3 0.283 


1.00 


3 5.76 


574 


CA 


ILE 


82 


11.7 04 


18.52 6 


3 1 . 618 


1.00 


3 3.89 


575 


CB 


ILE 


82 


11 .863 


17 .2 57 


32 .4 84 


1.00 


~KA 1 Q 


576 


CG2 


ILE 


82 


11.265 


17.4 93 


3 3.870 


1.00 


34.32 


577 


CGI 


ILE 


82 


11 . 204 


16 . 059 


31 . 789 


1.00 


34.07 


578 


CD1 


ILE 


82 


9.717 


16.249 


31.495 


1.00 


34 . 19 


579 


c 


ILE 


82 


12 .42 8 


19.678 


32.3 00 


1.00 


3 2.44 


580 


o 


ILE 


82 


13.619 


19 . 5 82 


32.5 83 


1.00 


32.31 


581 


N 


THR 


83 


11.711 


20.765 


32 .564 


1.00 


3 0.82 


582 


CA 


THR 


83 


12 .324 


21.923 


3 3 .208 


1.00 


2 9 .02 


583 


CB 


THR 


83 


12 .2 86 


2 3.164 


32.289 


1.00 


2 9.78 


584 


OG1 


THR 


83 


10.924 


2 3 .52 0 


32 . 02 7 


1.00 


3 0.24 


585 


CG2 


THR 


83 


12.993 


22 . 875 


3 0 . 971 


1 on 


9 Q Q7 


586 


C 


THR 


83 


11.699 


22.313 


34 . 547 


1.00 


2 7.39 


587 


o 


THR 


83 


12 .239 


23 . 155 


3 5 .2 61 


1 no 


97 n Q 


588 


N 


ARG 


84 


10 . 5 72 


21 . 705 


34 . 90 0 


1 no 


^ j . z j 


589 


CA 


ARG 


84 


9 . 92 0 


22 . 04 0 


3 6 . 164 


1 no 


9 7 9 4 


590 


CB 


ARG 


84 


8.515 


21.430 


3 6.226 


i no 


9 9 9 4 


591 


CG 


ARG 


84 


8.492 


19 . 925 


3 6.443 


1.00 


21 .26 


592 


CD 


ARG 


84 


7 . 063 


19.431 


3 6 . 615 


i on 


1 Q O -3 
17 . O J 


593 


NE 


ARG 


84 


6.265 


19.571 


3 5.397 


i on 


1 q 9 ft 


594 


CZ 


ARG 


84 


6.217 


18 . 667 


34 .421 


1 on 


1 Q 

X 2? . J O 


595 


NH1 


ARG 


84 


5.458 


18,882 


3 3,354 


1.00 


2 0 . 14 


596 


NH2 


ARG 


84 


6.920 


17.54 6 


34 . 511 


1.00 


1ft ft ft 
x 0 . 0 0 


597 


c 


ARG 


84 


10.726 


21.568 


3 7 . 3 7 3 


1.00 


22.25 


598 


0 


ARG 


84 


11.52 0 


2 0 62 8 


37.2 84 


1.00 


22 . 14 


599 


N 


THR 


85 


10.507 


22 .223 


3 8.509 


1.00 


21.24 


600 


CA 


THR 


85 


11 . 199 


21 . 872 


3 9.745 


1.00 


2 0.32 


601 


CB 


THR 


85 


11.2 66 


23.065 


40.714 


1.00 


2 0.06 


602 


OG1 


THR 


85 


9.938 


23 .413 


41 . 132 


1 . 00 


18.89 


603 


CG2 


THR 


85 


11.923 


24 .273 


40 . 040 


1.00 


19.65 


604 


C 


THR 


85 


10 .447 


20 . 755 


40 .454 


1 . 00 


20 . 32 


605 


0 


THR 


85 


9.307 


20 .440 


40 . 097 


1.00 


20 . 01 


606 


N 


LYS 


86 


11 .088 


20 . 166 


41.460 


1. 00 


20 . 00 


607 


CA 


LYS 


86 


10 .479 


19.096 


42 .240 


1.00 


20.26 


608 


CB 


LYS 


86 


11.461 


18 . 588 


43 .302 


1.00 


20.47 


609 


CG 


LYS 


86 


10 . 952 


17 .395 


44 . 112 


1.00 


20 . 87 


610 


CD 


LYS 


86 


11 .883 


17 . 061 


45 .283 


1. 00 


21 . 04 




25 



611 


CE 


LYS 


8 6 


13 .251 


16 .587 


44 . 8 05 


1 . 00 


91 0 7 


612 


NZ 


LYS 


8 6 


14 . 14 9 


16 .24 6 


*T J * -7D J 


i nn 

X . VJ VJ 


9 n n n 

Z VJ . VJ VJ 


613 


c 


LYS 


86 


9.228 


1 9 . 644 


4. 9 q9 R 

*± Z • 27 Z 3 


i nn 

X . VJ VJ 


9 n 74 

.Z VJ . J ft 


614 


o 


LYS 


86 


8.204 


18.970 


4 3 . 009 


i on 

X . VJ VJ 


1Q OR 
X -7 . O 3 


615 


N 


GLU 


87 


9.328 


2 0.877 


4 3 . 412 


1.00 


21.05 


616 


CA 


GLU 


87 


8.218 


21.540 


44 . 095 


1.00 


21.59 


617 


CB 


GLU 


8 7 


8 . 662 


2 2 .931 


4 4 R c q 

*± *± . J -J -7 


1 nn 

X * VJ VJ 


9 *3 7 7 

Zj . / J 


618 


CG 


GLU 


8 7 


8.242 


23 .308 


4 5 . 97 0 


1 0 0 

X . U VJ 


9 7 7 R 
z / . / j 


619 


CD 


GLU 


87 


8 . 596 


24.74 8 


4 ^17 
*± 0 • J x 0 


1 0 0 
x . u vj 


Z 27 . O O 


620 


OE1 


GLU 


87 


8 . 108 


25.662 


4 5 613 


1 on 

X . VJ VJ 


"7 1 in 

J X . X VJ 


621 


OE2 


GLU 


87 


9.363 


24 .96 9 


4 7 9 7 7 


1 n n 

X . VJ u 


"5 1 7 7 
J X . J / 


622 


Q 


GLU 


87 


7.013 


21 .668 


4 0 1 R R 


1 n n 
X . u u 


90 £ ^ 


623 


o 


GLU 


87 


5.871 


21 .408 


4 1 R4 R 


1 0 0 

X • VJ VJ 


9 0 R fl 

Z U . 3 O 


624 


N 


GLU 


8 8 


7.273 


22 . 073 


*± X . -7 Z VJ 


1 on 

X . VJ VJ 


1 q q n 

X -7 . -7 VJ 


625 


CA 


GLU 


8 8 


6.207 


2 2 .236 


4 0 q7 R 


1 n n 

X . VJ VJ 


1 q R 7 

X -7 . O J 


626 


CB 


GLU 


88 


6.751 


2 2 .93 6 


7 q fl R 


i n n 

X . VJ VJ 


9 n 4^ 

Z VJ . ■*! J 


627 


CG 


GLU 


88 


7.345 


24 .316 


0 q q p 7 

-J J . 27 O / 


1 0 0 

X . VJ VJ 


9 9 7 ^ 
z z . j 0 


628 


CD 


GLU 


88 


7.932 


25 .004 


3 8 . 767 


1 0 0 

X • VJ VJ 


9 7 1 

Z J * X D 


629 


OE1 


GLU 


8 8 


8 . 612 


2 4 .333 


77 q^£ 

J / . 27 D D 


1 on 

X . U VJ 


9 7 4 O 

■Z J . ft 17 


630 


OE2 


GLU 


8 8 


7.728 


Z O • z o 


7 fl £ 1 fl 
.JO * OXO 


1 n n 

X . VJ u 


9 A fl Q 
Z ft . O 27 


631 


Q 


GLU 


R fl 
o o 


R £ i 7 


on h 7 7 
Z VJ . o / / 


ft U . D / 3 


i n n 


1 Q C Q 


63 2 


o 


GLU 


8 8 


4.407 


9 0 7 R 7 

Z VJ . / 3 J 


4 O "3 ^ 7 


1 n n 
x . u u 


1 7 fl R 
X / . O O 


633 


N 


ALA 


8 9 


6.466 


1 Q QCQ 
X 27 . O J O 


4 n r 9 0 


1 n n 
X . u u 


17 7 fl 
X / . / O 


634 


CA 


ALA 


8 9 


6.024 


18.501 


40 9 n n 

ft vj . z u u 


i n n 

X . U VJ 


17 7 A 
X / . j ft 


635 


CB 


ALA 


8 9 


7.236 


X / • J O U 


4 n n 9 r 
ft u . uz 0 


1 on 
1 . uu 


1 c /i -1 

X O . TT X 


63 6 


Q 


ALA 


8 9 


R 191 

~J . J- Z X 


17 Q7f| 

X / . 27 / VJ 


4 1 71 1 
ft X . J X X 


1 n n 


id . 


637 


0 


ALA 


8 9 


4 . 104 


17.32 2 


4 -l 04- R 


1 n n 

X . VJ VJ 


17 n ^ 

X / . VJ D 


638 


N 


LEU 


90 


5.496 


18 .239 


4 2.559 


1 . 00 


1 £ q R 

ID . J/J 
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57 . 657 


1. 00 


39.89 


1280 


02 P 


SEP 


174 


-4 .410 


19. 263 


55 .377 


1 . 00 


40 .27 


1281 


03P 


SEP 


174 


-6 . 331 


19 . 526 


56 .783 


1 . 00 


40 . 16 


1282 


N 


PRO 


175 


-6 . 571 


13 . 850 


54 . 079 


1 . 00 


42 .06 
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12 83 


CD 


PRO 


175 


- 6 . 744 


13 . 234 


55.405 


1 . 00 


41 . 84 


12 84 


CA 


PRO 


175 


-7.333 


13 . 122 


53 . 058 


1 . 00 


41 . 94 


12 8 5 


CB 


PRO 


175 


- 8 . 043 


12.02 9 


53 . 862 


1 . 00 


41 . 89 


12 86 


CG 


PRO 


175 


-8.070 


12 . 566 


55 .257 


1 .00 


42 . 04 


12 8 7 


Q 


PRO 


175 


-8.289 


13 . 969 


52 . 217 


1 .00 


42 . 11 


12 88 


Q 


PRO 


175 


-8.502 


13 . 678 


51.039 


1 . 00 


42 . 01 


12 8 9 


N 


SER 


176 


- 8 . 863 


15 . 011 


52 . 812 


1 . 00 


42 . 14 


12 90 


CA 


SER 


176 


-9.777 


15 . 887 


52 . 081 


1 . 00 


42 . 14 


12 91 


CB 


SER 


176 


-11 . 199 


15.310 


52 . 073 


1 .00 


42 . 12 


12 92 


OG 


SER 


176 


-11.820 


15 .434 


53 . 340 


1 .00 


41 . 96 


12 93 


Q 


SER 


176 


-9.791 


17 . 273 


52 . 709 


1 . 00 


42 .33 


1 9 


o 


SER 


176 


- 9 . 105 


17.452 


53.738 


1 . 00 


42 . 23 


1 OQC 
X ^ 3 


OT 


SER 


176 


-10.485 


18 . 159 


52 . 167 


1 . 00 


42 . 54 


1 oqc 

1Z 7D 




WAT 


2 00 


2 . 0 14 


6.388 


33.070 


1 . 00 


13 . 93 


12 97 


OH2 


WAT 


2 01 


-1.283 


-2 - 300 


28 . 997 


1.00 


15 . 35 


1 9 Q R 

X ^ _7 O 


OH2 


WAT 


2 02 


-2.992 


-1.687 


35 . 759 


1.00 


21.83 


12 9 9 


OH 2 


WAT 


2 03 


-2.513 


19.277 


39.236 


1 . 00 


14 . 45 


13 0 0 


OH 2 


WAT 


2 04 


- 6 . 092 


9 . 665 


81 . 760 


1 . 00 


17 . 79 


13 01 


OH2 


WAT 


2 05 


1.926 


17.345 


34.818 


1 . 00 


15 . 51 


13 02 


OH2 


WAT 


2 06 


-2.63 8 


. 254 


52.032 


1.00 


18 . 20 


13 0 3 


OH2 


WAT 


2 07 


11.754 


22 .376 


44 . 104 


1 . 00 


20 . 03 


13 04 


OH2 


WAT 


2 08 


9.320 


-4 . 615 


42 . 075 


1 . 00 


21.68 


13 05 


OH2 


WAT 


2 09 


4.413 


10 . 176 


28.305 


1 . 00 


24 . 86 


13 06 


OH 2 


WAT 


210 


3.500 


15 . 584 


33 . 508 


1 . 00 


17 . 17 


13 0 7 


OH 2 


WAT 


211 


3.016 


- . 603 


29 . 684 


1 .00 


16 . 24 


1308 


OH2 


WAT 


2 12 


-3 .433 


- . 744 


29 . 090 


1 .00 


20 . 34 


13 0 9 


OH2 


WAT 


213 


-3 . 928 


16 . 386 


47 . 143 


1 . 00 


19 . 86 


1310 


OH2 


WAT 


2 14 


-1 . 492 


19.796 


34.357 


1 . 00 


23 . 80 


13 1 1 


OH 2 


WAT 


215 


5.749 


17 . 029 


31 . 175 


1 . 00 


17 . 04 


1312 


OH2 


WAT 


216 


-7.589 


4 . 674 


47.250 


1 . 00 


24 . 91 


1313 


OH2 


WAT 


217 


5.836 


-2 . 876 


50 . 964 


1 .00 


22 . 06 


1314 


OH2 


WAT 


218 


11.848 


8.037 


47 . 950 


1 . 00 


23 . 46 


1315 


OH 2 


WAT 


219 


12.348 


13.3 07 


37 . 042 


1 . 00 


20.57 


1316 


OH 2 


WAT 


220 


-4.113 


9 . 169 


47.403 


1 . 00 


23 . 76 


1317 


OH2 


WAT 


221 


5.203 


22.317 


35 . 018 


1 . 00 


22 . 18 


1318 


OH2 


WAT 


222 


9.525 


6 . 324 


30 . 502 


1 .00 


26.39 


1319 


OH2 


WAT 


223 


-10.609 


9.339 


53 . 354 


1 . 00 


30 . 61 


132 0 


OH2 


WAT 


224 


-3.990 


10.3 08 


49.848 


1 . 00 


21.74 


132 1 


OH 2 


WAT 


225 


9.853 


25 . 698 


42.396 


1 . 00 


29.89 


13 2 2 


OH 2 


WAT 


226 


-6 . 397 


11.568 


75 . 148 


1 . 00 


21 . 91 


13 2 3 


OH2 


WAT 


2 2 7 


7 54 8 


3.839 


49.241 


1.00 


24 . 61 


1324 


OH2 


WAT 


22 8 


-6.083 


4 . 855 


51 . 274 


1 . 00 


23.65 


132 5 


OH2 


WAT 


22 9 


-6.550 


7.286 


50.590 


1 . 00 


23.55 


132 6 


OH 2 


WAT 


230 


. 976 


11 . 832 


28 . 657 


1 . 00 


18 . 86 


1327 


OH 2 


WAT 


231 


-5.723 


7.029 


48 . 100 


1 .00 


26.00 


1328 


OH 2 


WAT 


232 


3 . 126 


23 . 015 


39 .476 


1 .00 


21.91 


1329 


OH2 


WAT 


233 


-10 . 950 


13 . 155 


65 . 324 


1 . 00 


25.05 


1330 


OH2 


WAT 


234 


-2 .711 


12 . 584 


49 . 681 


1.00 


29.21 


1331 


OH2 


WAT 


235 


7.434 


-10 . 766 


44 .709 


1. 00 


25 . 02 


1332 


OH2 


WAT 


236 


-4 .201 


1.994 


25 . 179 


1. 00 


21 . 85 


1333 


OH2 


WAT 


237 


-2 . 892 


14 . 979 


79 . 350 


1 . 00 


29 . 32 


1334 


OH2 


WAT 


238 


14 .239 


5 .232 


43 .399 


1 . 00 


27 .32 


1335 


OH2 


WAT 


239 


-3 . 070 


-1 . 184 


54 . 587 


1 . 00 


30.22 


1336 


OH2 


WAT 


240 


-11. 964 


9.420 


40 .306 


1.00 


24 . 61 


1337 


OH2 


WAT 


241 


-2 . 664 


20 .784 


36 . 612 


1 . 00 


25 . 57 


1338 


OH2 


WAT 


242 


15 .679 


. 913 


43 . 518 


1.00 


32 . 81 




38 



1339 


0H2 


WAT 


243 


1.248 


- . 885 


54 . 817 


1.00 


24 .39 


134 0 


0H2 


WAT 


244 


-8.178 


4 . 883 


76 . 583 


1 . 00 


26 . 66 


1 Jl 1 


OH 2 


WAT 


245 


-6.075 


15.256 


48.229 


1 . 00 


27 . 33 


1342 


OH 2 


WAT 


246 


-4.529 


1 . 584 


33 . 651 


1 . 00 


26.47 


"I 4 


OH2 


WAT 


247 


-1.823 


14 . 652 


48.096 


1 . 00 


27 .21 


134 4 


OH2 


WAT 


248 


-10 . 286 


7 . 153 


39 . 164 


1 . 00 


24 .26 


1 TAC 
Ijt J 


OH2 


WAT 


24 9 


8.150 


8.218 


48.429 


1 . 00 


23 .48 


1346 


OH 2 


WAT 


250 


-8.406 


2.586 


77 . 882 


1 . 00 


25 . 81 


1347 


OH 2 


WAT 


251 


13.368 


-6 . 143 


46 . 515 


1 . 00 


38 . 51 


134 8 


OH2 


WAT 


252 


- .009 


23 . 620 


39 . 303 


1.00 


36. 19 


134 9 


OH2 


WAT 


253 


-9.460 


14 . 934 


66 . 850 


1 . 00 


34 . 00 


1350 


OH2 


WAT 


254 


-3 . 751 


20 . 245 


45 . 512 


1 . 00 


37 . 11 


1351 


OH2 


WAT 


255 


12 . 681 


10.291 


34 . 535 


1 . 00 


26.21 


13 52 


OH2 


WAT 


256 


6 . 777 


. 601 


58 .270 


1 . 00 


32 . 10 


13 53 


OH2 


WAT 


257 


8.803 


20.438 


32 . 878 


1 . 00 


28 . 86 


13 54 


OH2 


WAT 


258 


.450 


6.445 


65 . 189 


1 . 00 


30.51 


i -IRC; 

-L O J J 


OH2 


WAT 


259 


-16.448 


9.387 


39.020 


1 . 00 


41. 90 


1356 


OH2 


WAT 


260 


15 . 918 


6 . 602 


36 . 928 


1 . 00 


43 .42 


1357 


OH2 


WAT 


261 


-8 . 987 


4 . 515 


44.837 


1 . 00 


34 .46 


i ^ ^ fl 

J. J 3 O 


OH2 


WAT 


2 62 


6.985 


13 . 991 


29 . 119 


1.00 


26 . 21 


1 -ICQ 
-L J 3 ^ 


OH2 


WAT 


263 


-7 . 897 


-7.308 


39.485 


1 . 00 


38.23 


13 60 


OH2 


WAT 


264 


6.735 


8.681 


53 . 846 


1 . 00 


30 . 04 


1361 


OH 2 


WAT 


265 


-7.201 


10 . 978 


28 . 692 


1 . 00 


40 . 13 


13 62 


OH2 


WAT 


266 


-11 . 277 


11 .811 


33 . 774 


1 . 00 


34 . 24 


13 63 


OH2 


WAT 


267 


-5.570 


16.341 


27.209 


1 . 00 


27 . 61 


1 ^ fi4 


OH2 


WAT 


268 


2 8 . 978 


19 . 691 


29 . 458 


1 . 00 


34 . 54 


13 65 


OH2 


WAT 


269 


- . 394 


-3 . 182 


56 . 909 


1 . 00 


37 .28 


13 66 


OH 2 


WAT 


270 


-5.901 


-6.275 


48 . 732 


1 . 00 


38 . 98 


13 67 


OH 2 


WAT 


271 


-5 . 314 


13.550 


50 . 350 


1 . 00 


28 . 96 


13 68 


OH2 


WAT 


272 


17 . 572 


7 . 657 


45 .251 


1 . 00 


37 . 08 


13 69 


OH2 


WAT 


273 


1.725 


. 998 


24.485 


1 . 00 


41 . 72 


13 70 


OH 2 


WAT 


274 


14 . 327 


-4 .819 


37 . 245 


1 . 00 


37 . 09 


1371 


OH 2 


WAT 


275 


6.276 


11.721 


57 . 649 


1 . 00 


47 .33 


13 72 


OH2 


WAT 


276 


- 15 . 076 


17 . 669 


59 . 792 


1 . 00 


40 . 97 


13 73 


OH2 


WAT 


277 


- . 429 


-2.499 


53 . 645 


1 . 00 


33 . 17 


13 74 


OH 2 


WAT 


278 


3.038 


-3 . 057 


54 . 799 


1 . 00 


39 . 54 


13 75 


OH 2 


WAT 


279 


-3 . 954 


9 .216 


72 . 671 


1 . 00 


39. 85 


1376 


OH 2 


WAT 


280 


-10.200 


1.398 


41 . 849 


1 . 00 


42 . 94 


13 77 


OH2 


WAT 


2 81 


- 10 . 292 


3.217 


52 . 063 


1 . 00 


43 . 76 


13 78 


OH2 


WAT 


2 82 


-15.225 


12 . 071 


60 . 987 


1 . 00 


45 . 57 


13 7 3 


OH 2 


WAT 


283 


4 0 01 


-4 .280 


52 .2 74 


1.00 


2 9.89 


13 80 


OH 2 


WAT 


284 


- 8 . 055 


-8.627 


42 . 074 


1 . 00 


41 . 79 


13 81 


OH 2 


WAT 


285 


-1 . 008 


-6 . 151 


62 .404 


1 . 00 


46 . 22 


13 82 


OH2 


WAT 


286 


-9.176 


10 . 031 


51 . 259 


1 . 00 


36 . 18 


13 83 


OH2 


WAT 


287 


-3.447 


5 . 807 


27 . 025 


1. 00 


39.54 


13 84 


OH2 


WAT 


288 


6 . 178 


1 . 935 


27 . 972 


1. 00 


41.27 


13 85 


OH 2 


WAT 


289 


- . 878 


11 . 803 


51 . 809 


1 . 00 


31.35 


1386 


OH2 


WAT 


290 


4 . 340 


. 648 


60 . 998 


1.00 


32 . 63 


1387 


OH2 


WAT 


291 


7 .811 


-2 .038 


52 . 827 


1.00 


36 . 03 


1388 


OH2 


WAT 


292 


-8 . 166 


18 .423 


67 . 710 


1 . 00 


38 . 02 


1389 


OH2 


WAT 


293 


-14 . 604 


10 .287 


35 .634 


1 . 00 


45 . 62 


1390 


OH2 


WAT 


294 


-10 .474 


20 .227 


53 . 859 


1 . 00 


46 . 74 


1391 


OH2 


WAT 


295 


17 . 892 


23 . 932 


22 .437 


1. 00 


42 . 17 


1392 


OH2 


WAT 


296 


17 . 145 


17 .710 


24 . 729 


1.00 


47 . 54 


1393 


OH 2 


WAT 


297 


1 .480 


20 . 841 


46 . 728 


1.00 


36 . 92 


1394 


OH2 


WAT 


298 


-10 .284 


6 . 137 


64 .005 


1.00 


46.42 
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1 O QC 
1 J " J 


0H2 


WAT 


2 99 


3 . 682 


3 . 530 


62 .317 


1.00 


44 . 53 


1 "5 Q & 


OH 2 


WAT 


3 00 


-2 . 017 


-8 . 476 


51 .591 


1.00 


45 .29 


i 7 Q7 
1 j 7 / 


0H2 


WAT 


3 01 


- . 106 


11 . 221 


83 . 869 


1 . 00 


37 . 74 


1 T QO 


OH 2 


WAT 


3 02 


-8 . 555 


9 .470 


27 .344 


1. 00 


50 . 52 


1 1QQ 




WAT 


3 03 


6 . 694 


27.979 


40 . 780 


1.00 


40 . 54 


i Ann 
14 u u 


wn z 


WAT 


3 04 


-9.496 


-4.420 


47 .793 


1.00 


40 . 89 


t a n i 




WAT 


3 05 


-15.396 


11 . 041 


56 . 150 


1 .00 


44 .20 


14 U Z 


riM9 
wnz 


WAT 


3 06 


-5 . 928 


-11.415 


43 .389 


1 . 00 


48 . 32 




unz 


WAT 


3 07 


-10.479 


-7 . 663 


44 . 866 


1. 00 


52 . 88 


14 04 


UnZ 


WAT 


3 0 8 


14.2 62 


4 . 179 


49.470 


1 . 00 


46 . 97 


1 A f\ C 

14 U o 


OU9 
unz 


WAT 


3 0 9 


-.041 


15 . 184 


62 .705 


1. 00 


40 . 11 


14 Ub 


Unz 


WAT 


310 


15.93 9 


3 . 184 


44 . 811 


1 .00 


44 .49 


14 U / 


UnZ 


WAT 


311 


3 . 817 


- . 929 


63 . 168 


1 . 00 


42 . 13 


14 0 8 


Uriz 


WAT 


3 12 


- 10 . 174 


- . 055 


47 . 822 


1 .00 


44 .73 


1 a n q 

14 u y 


UnZ 


WAT 


313 


13 . 924 


- . 156 


50 . 669 


1 . 00 


43 .49 


i a 1 n 
14 1 U 


unz 


WAT 


3 14 


4 . 943 


21.521 


46 . 064 


1 .00 


33 . 15 




OR 9 
unz 


WAT 


315 


-4 . 990 


11.627 


72 . 769 


1 . 00 


42 .40 


1 A 1 9 


unz 


WAT 


316 


-8.733 


15.608 


48 . 610 


1.00 


49.94 


1 yl 1 O 

14 1 -j 


unz 


WAT 


317 


-7 . 762 


17.938 


49 . 355 


1 .00 


40 .62 


TATA 
14 14 


unz 


WAT 


318 


-4.97 8 


- . 433 


31 . 726 


1 . 00 


37.41 


1 yl 1 C 

14 lo 




WAT 


319 


-4.68 9 


22.825 


55 . 686 


1.00 


47.23 


14 1 O 


AUT 

unz 


WAT 


320 


11.750 


-10 . 644 


40 . 083 


1 .00 


48 .25 


14 X / 


OH 2 


WAT 


321 


- 10 . 564 


18 . 537 


57 .762 


1.00 


37.40 


1 A 1 P 


0H2 


WAT 


322 


7.490 


10 . 973 


25 .298 


1 .00 


45 .18 


1 A 1 Q 
14 13 


OW9 
unz 


WAT 


32 3 


2.274 


12 . 065 


76 . 050 


1.00 


38 . 17 


i a 9 n 


unz 


WAT 


324 


-2 . 115 


17 . 289 


58 . 628 


1 .00 


41 . 95 


*| A 9 1 


OH 2 


WAT 


325 


16.399 


18 . 887 


20 . 072 


1.00 


48 .74 


1 A 9 9 


OH 2 


WAT 


326 


20 . 050 


19 . 008 


27 . 094 


1.00 


45.30 


1 A 9 1 
14 Z .3 


unz 


WAT 


3 27 


-10.935 


-4.259 


42 . 863 


1.00 


52 .28 


1 A 9 A 
14 Z *± 


OH 9 
unz 


WAT 


32 8 


-5.431 


-5 . 213 


36 .216 


1.00 


35.64 


1/OC 
l*iZj 


unz 


WAT 


32 9 


-9 . 928 


22 . 834 


64 . 306 


1 . 00 


45 .77 


T >1 O C 

14 z b 


unz 


WAT 
Wri X 


33 0 


- 8 .275 


6.285 


42 . 925 


1 . 00 


42 . 06 


14z / 


Unz 


WAT 


j j 1 


- 13 .04 9 


17.2 06 


57 . 668 


1 . 00 


38 . 15 


14 z o 


AUO 

unz 


WAT 
W>1 X 


3 32 


5.624 


-10 . 159 


35 . 555 


1 .00 


40 .21 


14 z y 


AU') 

Unz 


WAT 
V»n X 


3 3 3 


23 .293 


26 . 941 


32.316 


1 . 00 


48 .27 


t a "3 n 

14 J U 


ALTO 

unz 


WAT 


3 34 


1.567 


. 896 


62 .295 


1 .00 


37 .23 


1 A ^ 1 
14 O 1 


AU9 

unz 


WAT 


33 5 


-13.988 


-1.533 


56 . 782 


1 . 00 


49.02 


1 A 1 9 
14 J Z 


AUO 

unz 


WAT 


3 3 6 


6.259 


13.631 


61 .216 


1 . 00 


40 .35 


± *t J j 


OH9 
unz 


WAT 


337 


-9.939 


-2.809 


49 . 680 


1 . 00 


50 . 64 


1 A A 


unz 


WAT 


33 8 


-10.829 


10.035 


61 .256 


1 . 00 


39 . 01 


~\ A'aZL 
11 J J 


unz 


WAT 


33 9 


16 . 685 


10 . 608 


37 . 337 


1.00 


48 .68 


14 J O 


AUO 

unz 


WAT 


34 0 


-8.780 


. 857 


55 . 000 


1 . 00 


42 . 72 


1 A 1 "7 
1 4 J / 


AUO 

unz 


WAT 


341 


-8.843 


-3.223 


40 . 067 


1 . 00 


36 . 16 


14*30 
14 j 0 


unz 


WAT 


342 


. 612 


11.479 


80 . 681 


1 . 00 


45 . 15 


1439 


0H2 


WAT 


343 


1 .599 


-2 .462 


63 . 176 


1.00 


45 .28 


1440 


0H2 


WAT 


344 


18 . 905 


21.201 


43 . 743 


1 . 00 


46 . 85 


1441 


0H2 


WAT 


345 


9 . 154 


-12 . 805 


43 . 973 


1 . 00 


45 .21 


1442 


0H2 


WAT 


346 


-8.708 


18 .686 


55 . 960 


1 . 00 


41 . 01 


1443 


OH2 


WAT 


347 


-3 . 987 


-7 . 989 


48.762 


1.00 


44 .36 


1444 


0H2 


WAT 


348 


-2 . 038 


-9 . 706 


42 .264 


1.00 


40 .86 


1445 


0H2 


WAT 


349 


-11.305 


11 . 929 


49 . 565 


1 . 00 


45 . 07 


1446 


0H2 


WAT 


350 


3 .795 


3 .399 


77 . 919 


1 .00 


48 .57 


1447 


0H2 


WAT 


351 


8 . 628 


9. 813 


50.645 


1.00 


39.75 




. 40 ■ 

Note to Table 1 - Coordinates from restrained individual B- 
f actor refinement, refinement resolution: 62.017 - 1.840 A; starting 
r= .2597; free_r= .2866; final r^ .2444; free_r= .2708; B rmsd for 
bonded mainchain atoms=.84 6; target^ 1.5; B rmsd for bonded sidechain 
5 atoms= .985; target= 2.0; B rmsd for angle mainchain atoms= 1.551; 
target^ 2.0; B rmsd for angle sidechain atoms= 1.623; target= 2.5; 
wa= 1.17512; rweight= . 530173 ; target= mlf; steps= 30; space group = 
P2 1 2 1 2 1 (a= 35.270; b= 43.903 ; c= 124.659; (X=P=y=90°) ; B-COrrection 
resolution: 6.0 - 1.840; initial B-f actor correction applied to fobs: 

10 Bll= -1.655; B22= -3.389; B33= 5.044; B12= .000; B13= .000; B23= 

.000; B-f actor correction applied to coordinate array B: 2.789; bulk 
solvent: density level= .377734 e/A 3 ; B-factor= 30.924 A 2 reflections 
with | Fobs | /sigma_F < 0.0 rejected reflections with | Fobs | > 10000 * 
rms(Fobs) rejected theoretical total number of reflections in 

15 resolution range: 17526 (100.0 %) number of unobserved reflections 
. (no entry or |f|=0) : 419 (2.4 %) number of reflections rejected: 0 
(0.0 %) total number of reflections used: 17107 (97.6 %) number of 
reflections in working set: 16241 (92.7 %) ; number of reflections in 
test set: 866 (4.9 %) CRYST1 35.270 43.903 124.659 90.00 90.00 

20 90.00 P 21 21 21. 



The human Pinl numbering sequence differs from that above due to the 
addition of four residues at the N-terminus from the E.coli cloning/expression vector. 
Thus, one should add 4 to the native human Pinl sequence to arrive at the numbering 
25 scheme above. For instance, Trpl 1 of the original sequence is now Tip 15 above. 



In addition, in accordance with this invention, WW domains, Pinl WW domains, 
or Pinl WW domain mutants may be crystallized in co-complex with known WW 
domain binding agents, substrates, or inhibitors. The crystal structures of a series of such 
30 complexes may then be solved by molecular replacement and compared with that of a 
wild-type WW domain. Potential sites for modification within the various binding sites 
of the WW domain may thus be identified. This information provides an additional tool 
for determining the most efficient binding interactions, for example, increased 
hydrophobic interactions, between a WW domain and a chemical entity or compound. 

35 All of the complexes referred to above may be studied using well-known X-ray 

diffraction techniques and may be refined versus 2-3 A resolution X-ray data to an R 
value of about 0.20 or less using computer software, such as X-PLOR (Yale University, 
1992, distributed by Molecular Simulations, Inc.). See, e.g., Blundel & Johnson, supra; 
Methods in Enzymology, vol. 114 and 1 15, H. W. Wyckoff et al. y eds., Academic Press 

40 (1985). This information may thus be used to optimize known classes of WW domain 
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binding agents (e.g., inhibitors), and to design and synthesize novel classes of WW 
domain binding agents (e.g., inhibitors). 

The design of binding agents that bind or otherwise associate with or inhibit a 
WW domain according to the invention generally involves consideration of two factors. 
5 First, the compound or binding agent must be capable of physically and structurally 
associating with a WW domain. Non-covalent molecular interactions important in the 
association of a WW domain with a substrate include hydrogen bonding, van der Waals 
and hydrophobic interactions. 

Second, the compound or binding agent must be able to assume a conformation 
10 that allows it to associate with a WW domain. Although certain portions of the 
compound or binding agent will not directly participate in this association, those portions 
may still influence the overall conformation of the molecule. This, in turn, may have a 
significant impact on potency. Such conformational requirements include the overall 
three-dimensional structure and orientation of the chemical entity or compound in 
15 relation to all or a portion of the binding site, e.g., active site or accessory binding site of 
a WW domain (e.g., a Pinl WW domain), or the spacing between functional groups of a 
compound comprising several chemical entities that directly interact with a WW 
domain. 

The potential inhibitory or binding effect of a chemical compound on a WW 
20 domain may be analyzed prior to its actual synthesis and testing by the use of computer 
modeling techniques. If the theoretical structure of the given compound suggests 
insufficient interaction and association between it and a WW domain, synthesis and 
testing of the compound may be obviated. However, if computer modeling indicates a 
strong interaction, the molecule may then be tested for its ability to bind to a WW 
25 domain. Methods of assaying for WW domain activity are known in the art. Methods 
for assaying the effect of a potential binding agent can be performed in the presence of a 
known binding agent of a WW domain. For example, the effect of the potential binding 
agent can be assayed by measuring the ability of the potential binding agent to compete 
with a known binding agent. 
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An inhibitory or other binding compound of a WW domain may be 
computationally evaluated and designed by means of a series of steps in which chemical 
entities or fragments are screened and selected for their ability to associate with the 
individual binding pockets or other areas of a WW domain. 

5 One skilled in the art may use one of several methods to screen chemical entities 

or fragments for their ability to associate with a WW domain and more particularly with 
the individual binding pockets of the WW domain of Pinl. This process may begin by 
visual inspection of, for example, the active site on the computer screen based on the 
Pinl WW domain coordinates in Table 1. Selected fragments or chemical entities may 
10 then be positioned in a variety of orientations, or docked, within an individual binding 
pocket of a WW domain. Docking may be accomplished using software such as Quanta 
and Sybyl, followed by energy minimization and molecular dynamics with standard 
molecular mechanics forcefields, such as CHARMM and AMBER. 

Specialized computer programs may also assist in the process of selecting 
1 5 fragments or chemical entities. These include: 

1. GRID (Goodford, P. J., "A Computational Procedure for Determining 
Energetically Favorable Binding Sites on Biologically Important Macromolecules", J. 
Med. Chem., 28, pp. 849-857 (1985)). GRID is available from Oxford University, 
Oxford, UK. 

20 2. MCSS (Miranker, A. and M. Karplus, "Functionality Maps of Binding Sites: 

A Multiple Copy Simultaneous Search Method." Proteins: Structure. Function and 
Genetics, 11, pp. 29-34 (1991)). MCSS is available from Molecular Simulations, 
Burlington, Mass. 

3. AUTODOCK (Goodsell, D. S. and A. J. Olsen, "Automated Docking of 
25 Substrates to Proteins by Simulated Annealing", Proteins: Structure. Function, and 
Genetics, 8, pp. 195-202 (1990)). AUTODOCK is available from Scripps Research 
Institute, La Jolla, Calif. 
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4. DOCK (Kuntz, I. D. et ai, "A Geometric Approach to Macromolecule-Ligand 
Interactions", J. Mol. Biol., 161, pp. 269-288 (1982)). DOCK is available from 
University of California, San Francisco, Calif. 

Once suitable chemical entities or fragments have been selected, they can be 
5 assembled into a single compound or binding agent {e.g., an inhibitor). Assembly may 
be performed by visual inspection of the relationship of the fragments to each other on 
the three-dimensional image displayed on a computer screen in relation to the structure 
coordinates of the Pinl WW domain as set forth in Table 1. This would be followed by 
manual model building using software such as Quanta or Sybyl. 

10 Useful programs to aid one of skill in the art in connecting the individual 

chemical entities or fragments include: 

1. CAVEAT (Bartlett, P. A. et al, "CAVEAT: A Program to Facilitate the 
Structure-Derived Design of Biologically Active Molecules". In "Molecular Recognition 
in Chemical and Biological Problems", Special Pub., Royal Chem. Soc, 78, pp. 182-196 

15 (1989)). CAVEAT is available from the University of California, Berkeley, Calif. 

2. 3D Database systems such as MACCS-3D (MDL Information Systems, San 
Leandro, Calif.). This area is reviewed in Martin, Y. C, "3D Database Searching in 
Drug Design", J. Med. Chem., 35, pp. 2145-2154 (1992)). 

3. HOOK (available from Molecular Simulations, Burlington, Mass.). 

20 In addition to the method of building or identifmg a WW domain binding agent 

in a step-wise fashion one fragment or chemical entity at a time as described above, 
inhibitory or other WW domain binding compounds may be designed as a whole or "de 
novo" using either an empty active site or optionally including some portion(s) of a 
known inhibitor(s). These methods include: 

25 1 . LUDI (Bohm, H.-J., "The Computer Program LUDI: A New Method for the 

De Novo Design of Enzyme Inhibitors", J. Comp. Aid. Molec. Design, 6, pp. 61-78 
(1992)). LUDI is available from Biosym Technologies, San Diego, Calif. 
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2. LEGEND (Nishibata, Y. and A. Itai, Tetrahedron, 47, p. 8985 (1991)). 
LEGEND is available from Molecular Simulations, Burlington, Mass. 

3. LeapFrog (available from Tripos Associates, St. Louis, Mo.). 

Other molecular modeling techniques may also be employed in accordance with 
5 this invention. See, e.g., Cohen, N. C. et al. 9 "Molecular Modeling Software and 
Methods for Medicinal Chemistry", J. Med. Chem., 33, pp. 883-894 (1990). See also, 
Navia, M. A. and M. A. Murcko, "The Use of Structural Information in Drug Design", 
Current Opinions in Structural Biology, 2, pp. 202-210 (1992). 

Once a compound or binding agent has been designed or selected by the above 
10 methods, the efficiency with which that compound may bind to a WW domain may be 
tested and optimized by computational evaluation. 

A compound designed or selected as a WW domain binding agent may be 
further computationally optimized so that in its bound state it would preferably lack 
repulsive electrostatic interaction with the target site. Such non-complementary (e.g., 
15 electrostatic) interactions include repulsive charge-charge, dipole-dipole and charge- 
dipole interactions. Specifically, the sum of all electrostatic interactions between the 
binding agent and the WW domain when the binding agent is bound to the WW domain, 
preferably make a neutral or favorable contribution to the enthalpy of binding. 

Specific computer software is available in the art to evaluate compound 
20 deformation energy and electrostatic interaction. Examples of programs designed for 
such uses include: Gaussian 92, revision C (M. J. Frisch, Gaussian, Inc., Pittsburgh, Pa., 
1992); AMBER, version 4.0 (P. A. Kollman, University of California at San Francisco, 
1994); QUANTA/CHARMM (Molecular Simulations, Inc., Burlington, Mass. 1994); 
and Insight II/Discover (Biosysm Technologies Inc., San Diego, Calif, 1994). These 
25 programs may be implemented, for example, using a Silicon Graphics workstation, IRIS 
4D/35 or IBM RISC/6000 workstation model 550. Other hardware systems and software 
packages will be known to those skilled in the art of which the speed and capacity are 
continually modified 
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Once a WW domain binding agent has been selected or designed, as described 
above, substitutions may then be made in some of its atoms or side groups in order to 
improve or modify its binding properties. Generally, initial substitutions are 
conservative, e.g., the replacement group will have approximately the same size, shape, 
5 hydrophobicity and charge as the original group. Such substituted chemical compounds 
may then be analyzed for efficiency of fit to a WW domain by the same computer 
methods described above. 

A further aspect of the invention encompasses methods of treatment using 
inhibitors or binding agents of a WW domain. The WW domain activity has been 
10 associated with a number of diseases including, for example, hypertension and 
muscular dystrophy (Staub and Rotin, Structure 4(5):495-9, 1996); and breast cancer 
(Bednarak et al., Cancer Res., 60(8):2140-5, 2000). 

Cell proliferative disorders contemplated for treatment using the invention 
compounds and methods disclosed herein include disorders characterized by 
15 unwanted, inappropriate or uncontrolled cell growth. Particular examples include 
cancer, fibrotic disorders, non-neoplastic growths such as benign prostatic 
hypertrophy, endometriosis, psoriasis, and the like. Cancers contemplated for 
treatment in accordance with the present invention include both solid tumors and 
hematopoeitic cancers such as leukemias and lymphomas. 

20 Solid tumors that are treatable utilizing the invention compounds and methods 

include carcinomas, sarcomas, osteomas, fibrosarcomas, chondrosarcomas, and the 
like. Specific cancers contemplated for treatment include breast cancer, brain cancer, 
lung cancer (non-small cell and small cell), colon cancer, pancreatic cancer, prostate 
cancer, gastric cancer, bladder cancer, kidney cancer, head and neck cancer, and the 

25 like. 

Fibrotic disorders are generally characterized by inappropriate 
overproliferation of non-cancerous fibroblasts. Examples include fibromyalgia, 
fibrosis (cystic, hepatic, idopathic pulmonary, pericardial, and the like), cardiac 
fibromas, fibromuscular hyperplasia, restenosis, atherosclerosis, fibromyositis, and 
30 the like. 
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Invention compounds are additionally useful in inhibiting mitosis in 
pathogenic organisms and are, therefore, useful for treating infectious diseases. 
Particular infectious diseases treatable by the methods disclosed herein include 
bacterial infections and fungal infections. 

5 Bacterial infections contemplated for treatment using invention compounds 

and methods include infections caused by both gram-positive and gram-negative 
bacteria, including infections caused by Staphylococcus, Clostridium, Streptococcus, 
Enterococcus, Diplococcus, Hemophilus, Neisseria, Erysipelothricosis, Listeria, 
Bacillus, Salmonella, Shigella, Escherichia, Klebsiella, Enter obacter, Serratia, 
10 Proteus, Morganella, Providencia, Yersinia, Camphylobacter, Mycobacteria, and the 
like. Infection by such organisms causes a wide variety of disorders including 
pneumonia, diarrhea and dysentery, anthrax, rheumatic fever, toxic shock syndrome, 
mastoiditis, meningitis, gonorrhea, typhoid fever, gastroenteritis, brucellosis, cholera, 
bubonic plague, tetanus, tuberculosis, Lyme disease, and the like. 

15 Fungal infections contemplated for treatment using invention compounds and 

methods include systemic fungal infections, dermatophytoses and fungal infections of 
the genito-unrinary tract. Systemic fungal infections include those caused by 
Histoplasma, Coccidioides, Cryptococcus, Blastocyces, Paracoccidioides, Candida, 
Aspergillus, Nocardia, Sporothrix, Rhizopus, Absidia, Mucor, Hormodendrum, 

20 Phialophora, Rhinosporidium, and the like. Dermatophyte infections include those 

caused by Microsporum, Trichophyton, Epidermophyton, Candida, Pityrosporum, and 
the like. Fungal disorders of the genito-urinary tract include infections caused by 
Candida, Cryptococcus, Aspergillus, Zygomycodoides, and the like. Infection by such 
organisms causes a wide variety of disorders such as ringworm, thrush, San Joaquin 

25 fever or Valley fever, Gilcrist's disease, and the like. These infections can be 

particularly serious, and even fatal, in patients with a depressed immune system such 
as organ transplant recipients and persons with acquired immunodefficiency 
syndrome (AIDS). 

In a further aspect of the invention, invention compounds may be used as 
30 insecticides. The compounds of the invention prevent mitosis in insect cells, and thus 
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can be used to control the growth and proliferation of a variety of insect pests. This 
aspect of the invention has important applications in agriculture, such as in the field, 
in the storage of agricultural products, and the like. Additionally, invention 
compounds are useful for controlling insect populations in places inhabited by man, 
5 such as homes, offices, and the like. 

The particular invention compound(s) selected for therapeutic use as taught 
herein can be administered to a subject either alone or in a pharmaceutical 
composition where the compound(s) is mixed with suitable carriers or excipient(s). In 
treating a subject, a therapeutically effective dose of compound (i.e., active 
10 ingredient) is administered. A therapeutically effective dose refers to that amount of 
the active ingredient that produces amelioration of symptoms or a prolongation of 
survival of a subject. 

Toxicity and therapeutic efficacy of a compound can be determined by 
standard pharmaceutical procedures in cell culture or experimental animals. Cell 

1 5 culture assays and animal studies can be used to determine the LD 5 o (the dose lethal to 
50% of a population) and the ED 50 (the dose therapeutically effective in 50% of a 
population). The dose ratio between toxic and therapeutic effects is the therapeutic 
index, which can be expressed as the ratio LD 5 o/ED 50 . Compounds which exhibit 
large therapeutic indices are preferred. The data obtained from these cell culture 

20 assays and animal studies can be used in formulating a range of dosages suitable for 
use in humans. The dosage of such compounds lies preferably within a range of 
circulating concentrations that include the ED50 with little or no toxicity. The dosage 
may vary within this range depending upon a variety of factors, e.g., the dosage form 
employed, the route of administration utilized, the condition of the subject, and the 

25 like. 

For any compound used in the method of the invention, the therapeutically 
effective dose can be estimated initially from cell culture assays by determining an 
IC 50 (i.e., the concentration of the test substance which achieves a half-maximal 
inhibition of PPIase activity). A dose can then be formulated in animal models to 
30 achieve a circulating plasma concentration range that includes the IC50 as determined 
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in cell culture. Such information can be used to more accurately determine useful 
doses in humans. Levels in plasma may be measured, for example, by HPLC. The 
exact formulation, route of administration and dosage can be chosen by the individual 
physician in view of the patient's condition. (See e.g. Fingl et ai, 1975, in "The 
5 Pharmacological Basis of Therapeutics", Ch. 1 pi). 

It should be noted that the attending physician would know how to and when 
to terminate, interrupt, or adjust administration due to toxicity, to organ dysfunction, 
and the like. Conversely, the attending physician would also know to adjust treatment 
to higher levels if the clinical response were not adequate (precluding toxicity). The 
1 0 magnitude of an administered dose in the management of the disorder of interest will 
vary with the severity of the condition to be treated, with the route of administration, 
and the like. The severity of the condition may, for example, be evaluated, in part, by 
standard prognostic evaluation methods. Further, the dose and perhaps dose 
frequency will also vary according to the age, body weight, and response of the 
1 5 individual patient. Typically, the dose will be between about 1-10 mg/kg of body 
weight. About 1 mg to about 50 mg will be administered to a child, and between 
about 25 mg and about 1000 mg will be administered to an adult. A program 
jmparable to that discussed above may be used in veterinary medicine. 



cor 



Depending on the specific conditions being treated, such agents may be 
20 formulated and administered systemically or locally. Techniques for formulation and 
administration may be found in "Remington's Pharmaceutical Sciences," 1990, 18th 
ed., Mack Publishing Co., Easton, PA. Suitable routes may include oral, rectal, 
transdermal, vaginal, transmucosal, or intestinal administration; parenteral delivery, 
including intramuscular, subcutaneous, intramedullary injections, as well as 
25 intrathecal, direct intraventricular, intravenous, intraperitoneal, intranasal, or 
intraocular injections, just to name a few. 



For injection, compounds of the invention may be formulated in aqueous 
solutions, preferably in physiologically compatible buffers such as Hank's solution 
Ringer's solution, or physiological saline buffer. For transmucosal administration, 
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penetrants appropriate to the barrier to be permeated are used in the formulation. 
Such penetrants are generally known in the art. 

Use of pharmaceutically acceptable carriers to formulate the compounds 
herein disclosed into dosages suitable for systemic administration is within the scope 
5 of the invention. With proper choice of carrier and suitable manufacturing practice, 
the compositions of the present invention, in particular those formulated as solutions, 
may be administered parenterally, such as by intravenous injection. The compounds 
can be readily formulated using pharmaceutically acceptable carriers well known in 
the art into dosages suitable for oral administration. Such carriers enable the 
10 compounds of the invention to be formulated as tablets, pills, capsules, dragees, 

liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a subject 
to be treated. 

Agents intended to be administered intracellular^ may be administered using 
techniques well known to those of ordinary skill in the art. For example, such agents 

15 may be encapsulated into liposomes, then administered as described above. 

Liposomes are spherical lipid bilayers with aqueous interiors. All molecules present 
in an aqueous solution at the time of liposome formation are incorporated into the 
aqueous interior. The liposomal contents are both protected from the external 
microenvironment and, because liposomes fuse with cell membranes, are efficiently 

20 delivered into the cell cytoplasm. Delivery systems involving liposomes are 

discussed in International Patent Publication No. WO 91/02805 and International 
Patent Publication ino. WU y i/ i^jui, a.s> wen <*& u.o. ± aiem i>u. t,ooi;,ojj ^ •""^^ 
et al These publications and patents provide useful descriptions of techniques for 
liposome drug delivery and are incorporated by reference herein in their entirety. 

25 Pharmaceutical compositions contemplated for use in the present invention 

include compositions wherein the active ingredients are contained in an amount 
effective to achieve the intended purpose. Determination of an effective amount is 
well within the capability of those skilled in the art, especially in light of the detailed 
disclosure provided herein. 
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In addition to the active ingredients, these pharmaceutical compositions may 
contain suitable pharmaceutical^ acceptable excipients and auxiliaries which 
facilitate processing of the active compounds into preparations which can be used 
pharmaceutical^. 



5 The pharmaceutical compositions of the present invention may be 

manufactured in a manner that is itself known, e.g., by means of conventional mixing, 
dissolving, granulating, dragee-making, levigating, emulsifying, encapsulating, 
entrapping, lyophilizing processes, or the like. 

Pharmaceutical formulations for parenteral administration include aqueous 
10 solutions of the active compounds in water-soluble form. Additionally, suspensions 

of the active compounds may be prepared as appropriate oily injection suspensions. 

Suitable lipophilic solvents or vehicles include fatty oils such as sesame oil, or 

synthetic fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. 

Aqueous injection suspensions may contain compounds which increase the viscosity 
15 of the suspension, such as sodium carboxymethyl cellulose, sorbitol, dextran, or the 

like. Optionally, the suspension may also contain suitable stabilizers or agents which 

increase the solubility of the compounds to allow for the preparation of highly 

concentrated solutions. 



Pharmaceutical preparations for oral use can be obtained by combining the 
20 active compounds with solid excipient, optionally grinding the resulting mixture, and 
processing the mixture of granules, after adding suitable auxiliaries, if desired, to 
obtain tablets or dragee cores. 

Suitable excipients are, in particular, fillers such as sugars, including lactose, 
sucrose, mannitol, sorbitol, and the like; cellulose preparations such as, for example, 
25 maize starch, wheat starch, rice starch, potato starch, gelatin, gum tragacanth, methyl 
cellulose, hydroxypropylmethylcellulose, sodium carboxymethylcellulose, 
polyvinylpyrrolidone (PVP), and the like, as well as mixtures of any two or more 
thereof. If desired, disintegrating agents may be added, such as cross-linked 




polyvinyl pyrrolidone, agar, alginic acid or a salt thereof such as sodium alginate, and 
the like. 

Dragee cores are provided with suitable coatings. For this purpose, 
concentrated sugar solutions may be used, which may optionally contain gum arabic, 
5 talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, titanium dioxide, 
lacquer solutions, suitable organic solvents or solvent mixtures, and the like. 
Dyestuffs or pigments may be added to the tablets or dragee coatings for 
identification or to characterize different combinations of active compound doses. 

Pharmaceutical preparations which can be used orally include push- fit 
10 capsules made of gelatin, as well as soft, sealed capsules made of gelatin and a 

plasticizer, such as glycerol or sorbitol. The push-fit capsules can contain the active 
ingredients in admixture with filler such as lactose, binders such as starches, and/or 
lubricants such as talc or magnesium stearate and, optionally, stabilizers. In soft 
capsules, the active compounds may be dissolved or suspended in suitable liquids, 
15 such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, 
stabilizers may be added. 

The invention will now be described in greater detail by reference to the 
following non-limiting examples. 

EXAMPLES 

20 Protein purification and crystallography. His8-tagged Pinl was expressed and 

purified by Ni +2 -chelation chromatography, the histidine tag was removed by thrombin 
digestion, and the protein purified as described in Ranganathan et al 9 Cell 82:875-886, 
1997. All site-directed mutants were constructed using the QuikChange (Stratagene) 
protocol and purified like wild type Pinl . Crystals of the Pinl-YpSPTpSPS complex 

25 were grown in hanging drops at 4 °C by mixing 1 .0 jal of the Pinl -peptide complex with 
1.0 jil of a reservoir solution containing 100 mM MOPSO-Na + (pH 7.0), 28% (v/v) PEG 
8000, 2 mM DTT, stabilized in 100 mM MOPSO-Na + (pH 7.0), 25% (v/v) PEG 8000, 
20% (v/v) glycerol, 1 mM DTT, and rapidly frozen in a 100 K stream of nitrogen gas. A 
native data set extending to 1.84 A resolution was collected at the Stanford Synchrotron 
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Radiation Laboratory, beamline 9-1 (k = 0.98 A). Data were processed with DENZO 
and scaled with SCALEPACK (Otwinowski, Z., Minor, W. Meth. Enzymol. 276:307- 
326, 1997). The crystals contain one Pinl-peptide complex per asymmetric unit and 
belong to space group P2i2i2, (a = 35.27 A, b = 43.90 A, c = 124.66 A, a = P = y = 90°). 
5 The final data set with an overall R merge of 6.2 % (R merge = 33.7 % in the highest 
resolution shell) is 97.3 % complete (98.2 % in the highest resolution shell) for the 
resolution range of 62.02 A to 1 .84 A. The structure of the Pinl-YpSPTpSPS repeat 
complex was solved by molecular replacement with AMORE (Navaza, J. Acta 
Crystallogr. A 50:157-163, 1994) using the Pinl monomer as a starting model 

10 (Ranganathan et al 9 Cell 89:875-886, 1997). The resulting model was then positionally 
refined against all the data between 62.02 A and 1 .84 A using the default bulk solvent 
model in CNS with maximum likelihood targets (Brunger et al, Acta Crystallogr. D 
54:905-921, 1998). After refining the rebuilt model, water molecules were added 
automatically using CNS and edited manually in O. The final positional refinement 

15 converged to a crystallographic R factor of 24% (R free = 27%) for all data (17107 

reflections) between 62.02 A and 1.84 A. This model consists of residues 1 to 38 and 51 
to 163 of human Pinl, the entire phosphopeptide, and 151 water molecules. Model 
quality was checked with PROCHECK (Laskowski et aL, J. Appl. Crystallogr. 26:283- 
291, 1993). A total of 91.7% of the residues are in the most favored regions of the 

20 Ramachandran plot, 6.8% in the additional allowed region, and 0.8% in the generously 
allowed region. Glu-5 borders the generously allowed region. 

Binding Analysis. To determine the affinity of full length Pinl and its isolated 
PPIase and WW domains for a set of rhodamine-labeled peptides as shown in Table 2, 
were obtained commercially (SynPep Corporation, Dublin, CA) and allowed to interact 

25 with Pinl , PPIase and WW domains. These peptides were derived from proteins 

previously shown to interact with Pinl . While the set of peptides examined is limited, a 
consensus emerges from comparison of the highest affinity interactions. Lyophilized 
peptides were dissolved in 25 mM HEPES-Na + (pH 7.5), 100 mM NaCl, 1 mM DTT 
and stored for short periods of time at -20 °C. Peptides were labeled with the amine 

30 reactive reagent tetramethylrhodamine-5-(and-6)-isothiocyanate (Molecular Probes, Inc. 
Eugene, OR) using a 2:1 molar ratio of rhodamine to peptide in 0.1 M sodium 
bicarbonate (pH 9.0) at ambient temperature for 10 hr. Fluorescence data were collected 
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on a PTI Alphascan spectrofluorimeter (Photon Technology Instruments, Santa Clara, 
CA). Dissociation equilibrium constants for Pinl -peptide complexes were determined 
by measuring the change in fluorescence anisotropy of a set of peptide solutions at 
constant concentration of peptide each of which contained varied concentrations of Pinl 
5 using the procedure described previously (Vinson et al 9 Biochemistry 32:10871-10880, 
1998). The core binding element determined from this group of peptides is PXpSP, 
where X cannot be Gly. Surprisingly, the best binding sequences differ from the 
consensus Pinl -binding sequence of WFYpSPR determined using a degenerate P.Ser- 
Pro anchored peptide library (Yaffe et aL, Science 278:1957-1960, 1997). Furthermore, 

1 0 the dissociation constants roughly segregate into two affinity groups, lower affinity 
interactions more typical of other WW domain-peptide complexes with dissociation 
constants greater than 50 and higher affinity interactions with dissociation constants 
less then 30 |iM. Several of these peptides yielded crystalline specimens; the most well 
ordered crystals obtained thus far included the doubly phosphorylated CTD peptide, 

15 YpSPTpSPS. 

Table 2 Dissociation equilibrium constants (jiM) for Pinl-peptide complexes a 





Labeled Peptide 


Pinl 


WW 


PPIase 




WFYSPFLE (Pintide) (SEQ ID NO:5) 


>540 


n.d. 


n.d. 


20 


WFYpSPFLE (Pintide) (SEQ ID NO:6) 


17±2.0 


44±9.5 


86±11 




VPRpTPV (Cdc25c-T48) (SEQ ID NO:7) 


4.9±1.1 


7.7±3.3 


n.b. 




YLGpSPI (Cdc25c-S168) (SEQ ID NO:8) 


69±10 


91±30 


>400 




LYRpSPS (Cdc25c-S214) (SEQ ID NO:9) 


47±7.3 


72±15 


>500 




GSSpSPV (Weel-S123) (SEQ ID NO:10) 


73±4.9 


n.d. 


n.d. 


25 


PPApTPP (Mytl-T412) (SEQ ID NO: 1 1) 


12±2.1 


15±4.7 


n.b. 




PPGpSPP (Mytl-S416) (SEQ ID NO: 12) 


80±18 


126±21 


n.b. 




STSpTPR (Mytl-T455) (SEQ ID NO: 13) 


46±6.5 


39±12 


n.b. 




YSPTSPS (CTD) (SEQ ID NO: 14) 


n.b. 


n.b. 


n.b. 




YpSPTSPS (CTD-S2) (SEQ ID NO: 15) 


61±6.3 


110±26 


n.b. 


30 


YSPTpSPS (CTD-S5) (SEQ ID NO: 16) 


30±3.9 


34±5.9 


>500 




YpSPTpSPS (CTD-S2/S5) (SEQ ID NO: 17) 


10±0.83 


34±6.2 


390±82 



a Errors are deviations from theoretical binding isotherms. All peptides are derived 
from human proteins except for Pintide. n.b. is used when there is no detectable 
35 binding, n.d. stands for not determined. The phosphorylation sites are indicated with 
a single letter amino acid code and numbered according to sequences appearing in 
public databases. 
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Overall architecture. To map the recognition surface of Pinl for the CTD of 
hyperphosphorylated Pol IIo, we next carried out structural analysis of Pinl bound to a 
phosphorylated heptad repeat of the CTD, N-YpSPTpSPS-C. Crystals of the Pinl- 
YpSPTpSPS complex were obtained and the structure solved by molecular replacement 
using the published model of Pinl (Ranganathan et al, Cell 89:875-886, 1997) and 
refined to 1.84 A resolution. The CTD phosphopeptide resides in the cavity separating 
the PPIase domain from the WW domain. Electrostatic, hydrogen bonding, and van der 
Waals contacts to residues projecting outward from the concave surface of Pinl 's WW 
domain constitute the Pinl-peptide binding interface (Fig. 1 A). No contacts between the 
bound peptide and the Pinl PPIase domain are seen in this crystal structure. 

Two conformational differences exist in the current Pinl-peptide co-crystal 
structure relative to the previously reported structure (Ranganathan et al., Cell 89:875- 
886, 1997). The first change occurs in the PPIase domain. A nearly 70° rotation of the 
(31 -al (residues 64-80) catalytic loop results in an exposed PPIase domain active site. 
Arg-68 and Arg-69, which confer preferential binding of phosphorylated substrates to 
the Pinl PPIase domain, now reside well outside of the proline ring binding pocket 
formed by His-59, Cys-113, Leu-122, Met-130, Phe-134, andHis-157 (Figs. 1A-B). 
While a crystal packing interaction maintains this open active site architecture, the 
observed orientation of the pi-al catalytic loop likely reflects the inherent mobility of 
this selectivity filter when not engaged with a bound phosphopeptide. 

A second conformational change in the Pinl WW domain results in an 
exaggerated twist in the P-sheet concurrent with a contraction of the concave WW 
domain ligand binding surface. The upper third of p2' and p3' together with the p27p3' 
turn moves downward over the amino terminal half of the bound CTD peptide. This 
movement positions Arg-14 and Phe-25 in van der Waals contact with Pro-3' of the 
CTD peptide and Ser-32 within hydrogen bonding distance of the backbone carbonyl 
oxygen of Thr-4' on the CTD peptide. The lower half of the pi ' and p2' strands 
together with their connecting loop pivots upward positioning Ser-16 and Arg-17 for 
hydrogen bonding to the phosphate on P.Ser-5' of the CTD peptide. The complete 
phosphate binding module of the Pinl WW domain encompasses the side chains of Ser- 
16, Arg-17, and Tyr-23 and the backbone amide of Arg-17. A bound water molecule 
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mediates the hydrogen bond between Tyr-23 and the serine phosphate. The Tyr-23/Trp- 
34 aromatic pair spatially defines the pivot point for all of these structural changes. 

The aromatic rings of these residues align edge-to-face forming an aromatic 
clamp which accomodates the backbone atoms of Thr-4' and P.Ser-5' and the ring atoms 
of Pro-6' of the CTD peptide. Finally, the Ser-7' side chain hydroxyl group forms a 
hydrogen bond with the protonated indole nitrogen of Trp-34 (Fig. 2 A). Tyr-1 ' and 
P.Ser-2' of the CTD peptide exhibit a greater degree of flexibility reflected by the weak 
electron density associated with both of these CTD residues. The entire peptide binds as 
an extended coil with both P.Ser-Pro peptide bonds in the trans configuration. Modeling 
of either peptide bond in the cis configuration results in significant steric perturbations at 
the ligand binding interface that would likely result in a loss of peptide binding. This 
three-dimensional view supports an extended contact surface limited to five consecutive 
ligand residues with WW domains. 

Energetic significance of protein-lig and interactions. To more fully elaborate the 
WW domain peptide-binding interface quantitatively, the affinities of a limited series of 
Pinl mutants for the CTD peptide were measured by fluorescence anisotropy. The 
results of this analysis are summarized schematically in Figure 3. Replacement of Arg- 
17 by Ala results in a 6-fold decrease in binding affinity. In R17A Pinl, the remaining 
hydrogen bonds between the phosphate on Ser-5' and the side-chain hydroxyl group of 
Ser-16, the backbone amide of Ala- 17, and the water-mediated contact with the Tyr-23 
hydroxyl moiety provide an interface for low affinity binding of the phosphopeptide. E. 
nidulans Pinl contains an Asn residue in place of Arg-17 (Fig. 4 A). While neutral, the 
amide side chain at this position can act as a hydrogen bond donor to the P.Ser 
phosphate. Therefore, WW domains possessing neutral hydrogen bond donors at 
positions equivalent to 16, 17, and 23 in Pinl may provide an energetically acceptable 
interface for P.Ser/P.Thr-Pro recognition. WW4 of the Nedd4-like HECT domain 
ubiquitin ligase WWP1 contains a conserved Thr-Arg segment homologous to Ser-16 
and Arg-17 of Pinl (Fig. 4A). Binding analysis of isolated WW1, WW3, and WW4 of 
WWP1 demonstrates phosphorylation-dependent binding only to WW4. Moreover, 
several HECT domain ubiquitin ligase WW domains including WW3 of Rsp5 contain an 
equivalent hydrogen-bonding motif (Thr, Asn, and Tyr - Fig. 4A). In addition to the 
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group IV WW domains, a number of phosphoprotein-binding domains exist including 
SH2, PTB, STYX, SBF, 14-3-3, and FHA domains (Plowman et aL 9 Proc. Natl. Acad. 
Sci. USA 96:13603-13610, 1999). While structurally distinct, nearly all such domains 
utilize Arg residues for the selective recognition of pSer, pThr, and pTyr2 1,22,23 side 
5 chains in a manner analogous to Pinl 's WW domain. 

Mutations of the other two side chain hydrogen bond donors, Ser-16 and Tyr-23 
to Ala and Phe, respectively, each result in a 2.5-fold decrease in binding affinity. Trp- 
34 constitutes one of two conserved Trp residues found in nearly all WW domains 
described to date. Together with Tyr-23, this aromatic pair organizes around the Thr-4' 

10 and P.Ser-5 5 backbone, and Pro-6' of the CTD peptide. Replacement of Trp-34 by Phe 
and Ala cause a 6-fold and 1 8-fold reduction in binding affinity, respectively. The 
W34A mutant phenotype is consistent with the reduction of side chain volume leading to 
less efficient packing of a portion of the CTD peptide backbone and the Pro ring. All 
Pinl homologs described to date include one additional residue in the turn linking pi ' 

15 and p2' (Fig. 4A). This unique structural feature of group IV WW domains may 
facilitate the conformational change in the pi 7p2' turn that is necessary for the 
formation of the phosphate binding pocket. 

Surprisingly, mutation of Arg- 14 and Phe-25, which are predicted to be 
energetically important based upon the co-crystal structure, results in enhanced binding 

20 affinity upon replacement. Structural analysis of the R14A Pinl mutant without a 
peptide bound to its WW domain suggests that the WW domain exists in the 
conformation observed in the current Pinl-phosphopeptide complex rather than that of 
the wild type unliganded Pinl complex. The enhanced binding affinity observed for the 
R14A, F25L, F25V, and F25A mutants might be due in part to preferential stabilization 

25 of the peptide-binding WW domain conformation in the absence of ligand. 

Comparison with other WW domains. The NMR-derived structure of the 
YAP65 WW domain complexed with a PPxY-containing peptide had a structurally 
distinct binding interface from that of the CTD heptide bound Pinl WW domain (Fig. 
4b) (Macias et al 9 Nature 382:646-649, 1996). While this may reflect distinct 
30 mechanisms for ligand recognition by group 1 and group IV WW domains, the recent 
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structural analysis of the dystrophin WW domain in complex with a PPxY-containing 
peptide argues against this (Huang et al, Nature, submitted (2000)). Modeling the 
peptide bound to YAP65 in a manner similar to that observed in the Pinl-CTD complex 
results in a peptide orientation superimposed on the extended binding interface observed 
for Pinl . However, the direction of the polypeptide chain is reversed. This capacity to 
bind protein ligands in a bi-directional manner is consistent with the dystrophin WW 
domain structure and is reminiscent of the alternative binding modes utilized by SH3 
domains (Kuriyan, J., Cowburn, D., Annu. Rev. Biophys. Biomol. Struct. 26:259-288, 
1997). While Pinl and dystrophin bind their respective ligands in the opposite N- to C- 
terminal direction, apart from Pinl 's ability to bind phosphopeptides, the chemical 
features of the peptide-binding interface are nearly identical. 

The picture that emerges is of a rather limited WW domain contact surface that 
relies on a set of energetically modest side chain interactions, none of which is 
absolutely essential for ligand binding. Rather, the summed contributions of this module 
with ligands spanning five consecutive amino acid residues together with interactions 
with additional modular domains and longer polypeptide targets likely contribute to 
target selection in WW domain containing proteins such as Pinl. 

While the invention has been described in detail with reference to certain 
preferred embodiments thereof, it will be understood that modifications and variations 
are within the spirit and scope of that which is described and claimed. 



